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MOLECULAR CLONES WITH MUTATED 
HIV GAG/POL, SIV GAG AND SIV ENV GENES 

L TECHNICAL FIELD 

The invention relates to nucleic acids comprising mutated HTV-1 
gag/pol and SIV gag gene sequences which are capable of being expressed 
independently of any SIV or HTV regulatory factors. The invention also relates to 
nucleic acids comprising a mutated SIV env gene sequence, which is capable of 
being expressed independently of any SIV or HTV regulatory factors. The preferred 
nucleic acids of the invention are capable of producing infectious viral particles. 

The invention also relates to vectors, vector systems and host cells 
comprising the mutated HIV-1 gag, HIV-1 pol and/or SIV gag gene sequences. The 
invention also relates host cells comprising these nucleic acids and/or vectors or 
vector systems. The invention also relates to the use of these nucleic acids, vectors, 
vector systems and/or host cells for use in gene therapy or as vaccines. 

H. BACKGROUND 

Until recently, gene therapy protocols have often relied on vectors 
derived from retroviruses, such as murine leukemia virus (MLV). These vectors are 
useful because the genes they transduce are integrated into the genome of the target 
cells, a desirable feature for long-term expression. However, these retroviral vectors 
can only transduce dividing cells, which limits their use for in vivo gene transfer in 
nonproliferating cells, such as hepatocytes, myofibers, hematopoietic stem cells, and 
neurons. 

Lentiviruses are a type of retrovirus that can infect both dividing and 
nondividing cells. They have proven extremely efficient at providing long-term 
gene expression (for up to 6 months) in a variety of nondividing cells (such as, 
neurons and macrophages) in animal models. See, e.g., Amado et al., Science 
285:674-676 (July 1999). It has been proposed that the optimal gene transfer 
system would include a vector based on HIV, or other lentivirus, that can integrate 
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into the genome of nonproliferating cells. Because retroviruses integrate in the 
genome of the target cells, repeated transduction is unnecessary. Therefore, in 
contrast to an adenoviral vector capable of in vivo gene delivery, problems linked to 
the humoral response to injected viral antigens can be avoided. See, e.g., Naldini et 
al. s Science, 272:263-267 (1996), p. 263. 

HTV and other lentiviruses have a complex genome that, in addition 
to the essential structural genes (env, gag, and pol), contains regulatory (tat and rev) 
and accessory genes (vpr, vif, vpu, and nef). HIV has evolved to efficiently infect 
and express its genes in human cells, and is able to infect nondividing cells such as 
macrophages because its preintegration complex can traverse the intact membrane 
of the nucleus in the target cell. This complex contains, in addition to the viral 
DNA, the enzyme integrase, the product of the vpr gene, and a protein encoded by 
the gag gene called matrix. The matrix protein enables the preintegration complex 
to pass into the nucleus to access the host DNA. Lentiviruses cannot efficiently 
transduce truly quiescent cells (cells in the Go state). However, unlike murine 
retroviral vectors, in addition to being able to infect dividing cells, HTV-based 
vectors can achieve effective and sustained transduction and expression of 
therapeutic genes in nondividing cells, such as hematopoietic stem cells and in 
terminally differentiated cells such as neurons, retinal photoreceptors, muscle, and 
liver cells. See, e.g., Amado et al. (July 1999) and Klimatcheva et al., Frontiers in 
Bioscience 4:d48 1-496 (June 1999), and the references cited therein. 

Although lentiviral vectors can be efficient gene delivery vehicles, 
there are safety concerns due to their origin. Therefore, the field has turned its 
attention to the development of vectors and production systems with built-in safety 
features to prevent the emergence of replication competent lentivirus (RCL). For 
example, in most laboratory applications, lentiviral vectors are generally created in a 
transient system in which a cell line is transfected with three separate constructs: a 
packaging construct, a transfer construct, and an envelope encoding construct. The 
packaging construct contains the elements necessary for vector packaging (except 
for env) and the enzymes required to generate vector particles. The transfer 
construct contains genetic cis-acting sequences necessary for the vector to infect the 
target cell and for transfer of the therapeutic (or reporter) gene. The lentivirus env 
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gene is generally deleted from the packaging construct and instead the envelope 
gene of a different virus is supplied in a third vector "the env-coding vector", 
although the lentiviruses env gene may be used if it is desired that the vector be 
intended to infect CD4*T cells. A commonly used envelope gene is that encoding 
the G glycoprotein of the vesicular stomatitis virus (VSV-G), which can infect a 
wide variey of cells and in addition confers stability to the particle and permits the 
vector to be concentrated to high titers (see, e.g., Naldini et al., Science 272:263-267 
(1996) and Akkina et al. J. Virol 70:2581 (1996). The use of three separate 
constructs and the absence of overlapping sequences between them minimizes the 
possibility of recombination during lentivirus (transfer) vector production. In 
addition, because no viral proteins are expressed by the lentiviral (transfer) vector 
itself, they do not trigger an effective immune response against cells expressing 
vector in animal models (a particular problem with vectors based on adenovirus). 
See, e.g., Amado et al., Science 285:674-676 (July 1999) and the references cited 
therein. See also Naldini et al. Science 272:263-267 (1996). 

The initial packaging plasmids contained most HIV genes except for 
env. In an effort to improve safety, subsequent HIV vectors have been produced in 
which the packaging plasmid is devoid of all accessory genes. This process does 
not interfere with efficient vector production and significantly increases the safety 
of the system because potential RCLs lack the accessory genes necessary for 
efficient replication of HIV in humans. Although these vectors can transduce 
growth-arrested cell lines and neurons in vivo, they have been reported to not 
efficiently transduce macrophages. The accessory gene vpr is believed to be 
necessary for HTV infection of these cells using these HIV vectors. See, Zufferey et 
al., Nature Biotechnol. 15:871-875 (1997). In contrast, as discussed later herein, 
the HIV-based lentiviral vectors of the present invention do not need any HTV 
accessory genes in order to be able to infect human macrophages and the other cells 
tested. 

The requirement of vpr or vifiox efficient transduction of liver cells 
has also been reported. See, e.g., Kafri et al., Nature Genet. 17:314 (1997). These 
results indicate that the requirement of accessory genes for efficient lentivirus- 
mediated gene transfer is dependent on the type of cell chosen as target, suggesting 
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that future applications of lentiviral vectors may involve vector constructs with 
different accessory genes, as needed. 

Zufferey et al., (1 997) describe an HTV vector system in which the 
virulence genes, em, vif, vpr, vpu t and we/have been deleted. This multiply 
attenuated vector conserved the ability to transduce growth-arrested cells and 
monocyte-derived macrophages in culture, and could efficiently deliver genes in 
vivo into adult neurons. The packaging plasmids described Zufferey et al. (1997) 
andNaldini et al. (1996) encode Rev and Tat, in addition to Gag and Pol. 

Lentiviral vectors engineered to become packaged into virions in the 
absence of the regulatory gene tat have also been described. See, e.g., Kim et al., J. 
Virol. 72:811-816 (1998) and Miyoshietal. J. Virol. 72:8150-8157 (1998). In 
these vectors the tat gene has been removed from the packaging plasmid. Kim et al. 
state that tat is not necessary as long as the serial 5' LTR promoter is replaced with 
a strong constitutive promoter. It also has other advantages for HTV therapy. 
Replacement of the HIV-1 LTR with a constitutive HCMV promoter permits the use 
of anti-Tat molecules such as Tat transdominant mutants or Tat activation response 
element decoys as therapeutic agents, since they will not affect vector production, 
(see p. 814, col 2). The removal of the tat gene eliminates an essential virulence 
factor that could contribute to a possible RCL. Kim et al. (1 998) describe a. vector 
system which does not contain tat, vif vpr, vpu and nef The preferred vector 
system includes the rev gene which, the authors state 'Svith RRE, is required for 
efficient RNA handling in this system." (p. 81 1, col. 2). However, Kim et al. also 
constructed Rev independent constructs using CTE. Kim et al. state that the 
rev/KRE components could be removed by using a sequence such as the Mason- 
Pfizer monkey virus (MPMV) constitutive transport element (CTE), thereby 
eliminating all accessory proteins, but this leads to a significant reduction in titer. 

Srinivasakumar et al., J. Virol. 71:5841-5848 (1997) describes the 
generation of stable HIV- 1 packaging lines that constitutively express high levels of 
HIV-1 structural proteins in either a Rev-dependent or a Rev-independent fashion. 
These cell lines were used to assess gene transfer by using a HTV-1 vector 
expressing the hygromycin B resistance gene and to study the effects of Rev, Tat, 
and Nef on the vector titer. The Rev-independent cell lines were created by using 
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gag-pol and env expression vectors that contain the MPMV CTE. This article 
describes the construction of four plasmids, among others: CMV gagpol-RRE and 
pCMVenv, which require Rev coexpression for HTV-1 structural gene expression, 
and pCMV gagpol-CTE and pCMVenv-CTE, which do not. To create Rev- 
containing and Rev-independent packaging, cell lines, CMT3 cells were transfected 
with vectors expressing Gag, Gag-Pol, and Env, using a calcium phosphate 
transfection procedure. 

By creating an HIV vector which contained the MPMV CTE 
(pTR167-CTE) and a packaging cell line which expressed the HIV structural 
proteins in a Rev-independent fashion, the authors were able to obtain a HTV vector 
system that functions completely without Rev. The titer of the vector obtained from 
this system was essentially the same as that obtained from a parallel system which 
contained Rev. The authors state that, in this context, the CTE seemed to substitute 
completely for Rev-RRE functions, similar to what was previously observed in 
transient-expression assays with Rev-dependent constructs. This is in contrast to 
situations where several rounds of HTV replication were measured. In those cases, 
titers from CTE-containing viruses were always reduced by at least 1 log unit 
compared to viruses utilizing Rev and the RRE. (See, Srinivasakumar et al., p. 
5847). 

The authors state that the advantages of having a HIV vector system 
that works in the absence of Rev opens the possibility of using it as a delivery 
vehicle for intracellular immunization against Rev function. Genes encoding Rev 
antagonists that have dramatic inhibitory effects on HIV replication, such as Rev 
M10 or RRE decoys, could be introduced into an HTV vector and put into cells 
normally infectable by HIV. Expression of the "anti-Rev" gene would be expected 
to dampen HIV infection. Any residual HIV replication should lead to activation of 
the vector LTR (by Tat) and create a vector-derived RNA that would be packaged 
by proteins derived from the infectious virus. In this scenario, the wild-type virus 
would act as a helper that may allow the spread of vector particles to previously 
nonimmunized cells. Because of the additional vector spread, it is likely that this 
type of scheme will be more effective in modulating HTV infection in vivo than one 
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based on traditional retrovirus vectors. The authors state that they are currently 
testing this approach in model systems. (See, Srinivasakumar et al, p. 5847). 

Another development in the quest for a safe system is the so-called 
self-inactivating (SIN) vector. See, e.g., Yu et al., Proc Natl Acad Sci USA 
83:3194-8 (1986) and Miyoshi et al., J. Virol. 72:8150 (1998). In Yu et al, a 
retrovirus-derived vector SIN vector was designed for the transduction of whole 
genes into mammalian cells. The SIN vector of Yu et al. contains a deletion of 299 
base pairs in the 3' long terminal repeat (LTR), which includes sequences encoding 
the enhancer and promoter functions. When viruses derived from such vectors were 
used to infect NIH 3T3 cells, the deletion was transferred to the 5 f LTR, resulting in 
the transcriptional inactivation of the provirus in the infected cell. Introduction of a 
hybrid gene (human metallothionein-promoted c-fos) into cells via a SIN vector was 
not associated with rearrangements and led to the formation of an authentic mRNA 
transcript, which in some cases was induced by cadmium. The vector described in 
Miyoshi et al. also contains a deletion the 3' (downstream) LTR. A sequence within 
the upstream LTR serves as a promoter under which the viral genome is expressed. 
The deletion introduced in the downstream LTR is transferred to the upstream LTR 
during reverse transcription. This deletion inactivates the LTR promoter and 
eliminates the production of vector RNA. The gene (or genes) to be transferred 
(e.g., a reporter or therapeutic gene) is expressed from an exogenous viral or cellular 
promoter that is inserted into the lentivirus vector. An important safety feature of 
SIN vectors is that inactivation of the promoter activity of the LTR reduces the 
possibility of insertional mutagenesis (of the transfer vector) into the host genome. 
In addition, because the expression of the (transfer) vector RNA is eliminated, the 
potential for RCL production in the target cell is further minimized. SIN vectors 
should be particularly useful in gene transfer experiments designed to study the 
regulated expression of genes in mammalian cells. Absence of enhancer and 
promoter sequences in both LTRs of the integrated provirus should also minimize 
the possibility of activating cellular oncogenes and may provide a safer alternative 
to be used in human gene therapy. Other modifications to enhance safety and 
specificity include the use of specific internal promoters that regulate gene 
expression, either temporally or with tissue or cell specificity. 
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Other strategies to improve safety in human studies would be to use 
nonhuman lenti viruses such as simian immunodeficiency virus, bovine 
immunodeficiency virus, or equine infectious anemia virus. Of these, vectors 
derived from the feline immunodeficiency virus have been engineered to efficiently 
transduce nondividing human cells. See, e.g., Poeschla et al., Nature Med. 4:354- 
357 (1998) and WO 99/15641 . In addition, White et al., J. Virol. 73:2832-2840 
(April 1999) described lenti viral vectors using human and simian immunodeficient 
virus elements in attempt to improve safety by reducing the likelihood of 
recombination between packaging constructs and transfer constructs. 

The development of efficient packaging lines has proven challenging 
because expression of the VSV-G envelope and a number of HTV proteins is toxic to 
cells. Recently, a producer line has been designed in which the expression of 
packaging genes and VSV-G, and therefore the production of vector, can be turned 
on at will. Kafiri etal, J. Virol. 73-576-584 (1999). The cell line can be expanded 
for scale-up vector production when the expression of toxic genes is turned off. 
This cell line produces high titer vector without generating RCL. Hematopoietic 
stem cells transduced with an HTV vector were transplanted into rhesus macaques as 
described by Donahue et al. Blood 92 (suppl. 1), abstract 4648.5 (1998) with at least 
a 14-month follow-up. At that time the procedure proved to be safe; all animals in 
the study have remained healthy without evidence of circulating HIV or vector. 
See, Amado et al., Science 285:674-676 (July 1999). 

Many gene therapy protocols have been designed to correct a number 
of inherited metabolic, infectious, or malignant diseases using the hematopoietic 
stem cell. This cell has the capacity to self-renew and to differentiate into all of the 
mature cells of the blood and immune systems. Many diseases that affect these 
systems could potentially be treated by the stable introduction of therapeutic genes 
into stem cells. Recently, lenti viral vectors were shown to bypass the need for ex 
vivo stem cell stimulation (which is necessary when using murine retroviral 
vectors), by mediating efficient gene transfer into very primitive human stem cells 
that contributed to stable, long-term reconstitution of SCID mouse bone marrow 
with many hematopoietic lineages. See, e.g., Miyoshi et al., Science 283:682 
(1999). Similarly, in a rhesus macaque model of autologous transplantation with 
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lenti virus-transduced stem cells, multilineage gene expression was found, 
suggesting transduction of an early blood cell progenitor under conditions of 
minimal stem cell stimulation, ordinarily insufficient for transduction with murine 
retroviruses. See, Donahue et al., Blood 92 (suppl. 1), abstract 4648.5 (1999) and 
Amado et al., Science 285:674-676 (July 1999). 

In HTV infection, another advantage of lentiviral vectors designed 
against HTV is their potential to be mobilized by HIV in the infected patient, 
because the virus supplies all of the necessary elements for packaging of the vector. 
If these mobilized vectors contained the HTV envelope, they could efficiently 
transfer their genes (for example, genes custom-designed to confer resistance 
against HTV) into CD4 + T cells, protecting them from subsequent HTV infection. 
Lentiviral vectors can also be designed to efficiently express their genes only in 
CD4 + T cells that are infected with HTV (so called /^-inducible vectors). In these 
vectors, all HIV genes, including tat and rev, are ablated; cis-acting sequences 
required for integration, expression, and packaging are retained, and expression is 
dependent on the activity of the HIV LTR (which requires transactivation by Tat). 
It has been shown that in this system, vector expression is induced efficiently upon 
HIV infection. Moreover, in the absence of genes that confer resistance against 
HTV, stable integration of this vector in permissive cell lines resulted in inhibition of 
HIV replication. Although the mechanism of HTV inhibition has not been 
completely elucidated, preliminary results suggest that this vector competes with 
HIV at the level of reverse transcription. See, An et al., J. Virol., in press, and 
Amado et al., Science 285:674-676 (1999). 

A number of other potential medical applications, where the 
modification of the genetic material of quiescent cells could result in the prevention 
or reversal of a disease process, are beginning to be explored. For example, the 
finding that lentiviral vectors can mediate stable and long-term gene transfer by 
direct injection of vector into the rat and mouse retina has lent support to the notion 
of gene therapy for the treatment of retinitis pigmentosa. This degenerative disease 
of the retina is characterized by photoreceptor cell death, resulting in a slow 
progression to blindness. Mutations in the cGMP phosphodiesterase P subunit 
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(PDEP) gene of rod photoreceptors lead to an autosomal recessive form of retinitis 
pigmentosa in humans, and in the rd mouse model of the disease. Previous studies 
have shown that adenovirus and adeno-associated virus-mediated PDEP subretinal 
gene transfer results in a delay in photoreceptor cell death. Using the rd mouse 
model, a recent study demonstrated that photoreceptors could be rescued in up to 
50% of eyes injected with a lentivirus vector containing the murine PDEp gene. In 
contrast with the short-term expression previously obtained with adenovirus vectors, 
PDEp expression in this study persisted for at least 24 weeks. This finding points to 
the potential success of gene therapy in a disease that currently lacks effective 
treatment. See, Takahashi et al., J. Virol, 73:7812-7816 (Sept. 1999) and Amado et 
al. Science, 285:674-676 (1999). 

In nature, the expression of gag, pol y and env of HTV-1 depends on 
the presence of the viral Rev protein. This dependence is, at least in part, due to the 
presence of negatively acting sequences (inhibitory or instability elements [INS]) 
located within unspliced and partially spliced mRNAs. The positive interaction of 
Rev with the Rev-responsive element [RRE] in these mRNAs counteracts the 
negative effects of the inhibitory sequences. 

None of the above references teach or suggest that the gag and/or pol 
genes described therein may be replaced with the gag and/or pol genes in which the 
inhibitory/instability have been mutated to render their expression Rev-idependent. 
Furthermore, there is no disclosure of the specific HIV-1 gaglpol or SIV gag 
mutated genes described herein. 

The gag/pol clone of the invention was made using the method for 
eliminating inhibitory/instability regions from a gene as first described in U.S. 
patent application Serial No. 07/858,747, filed March 27, 1992, (inventors, G. 
Pavlakis and B. Felber) entitled "Method of Eliminating Inhibitory/Instability 
Regions from mRNA" and later described in a Continuation-in-Part ("CIP") 
application, filed as PCT application PCT/US93/02908 on March 29, 1993 and U.S. 
Patent Nos. 5,972,596 and 5,965,726. The disclosure of the CIP application was 
published as International Publication No. WO 93/20212 on October 14, 1993. 
(The disclosures of these patents and patent applications are specifically 
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incorporated by reference herein in their entirety.) The method was also described 
in Schwartz et al., J. Virol. 66:7176-7182 (1992). 

Schneider et al., J. Virol. 71:4892-4903 (1997), extend the work 
described in the patent applications and in Schwartz et al. by identifying and 
characterizing additional INS within gag, protease and pol genes and mutating them 
in a similar manner. Schneider et al. disclose nucleic acid constructs which contain 
completely mutated HIV-1 gag genes, but only partially mutated HTV-1 pol genes. 

Schneider et al. demonstrate that expression vectors containing an 
intact or nearly intact p55 OT region allow the production of immature viral particles 
in mammalian cells in the absence of any other HTV proteins. The introduction of 
additional mutations in the protease region allowed efficient production of 
Gag/protease, which resulted in processing of the Pt55 w precursor and production 
of mature Gag particles with a lentivirus-like conical-core structure. 

Schneider et al. disclose that Rev-independent expression vectors 
allow the efficient expression of Gag proteins in many cell lines that are not able to 
support efficient Rev-RRE-dependent rescue of these RNAs. Schneider et al. also 
disclose that gag/pol expression vectors may be important for vaccination 
approaches against HIV-1, since the gag/pol region is more conserved than is the 
env region and may be important for an effective immune response against HTV and 
for protection against infection. They also state that efficient HTV gene expression 
in many cells is also of interest for possible gene transfer experiments using 
lentiviral vectors in nondividing or slowly dividing cells, since HTV and the other 
lentiviruses are able to infect quiescent cells. 

Pavlakis et al., Natl Conf Hum Retroviruses Relat Infect (2nd). 
(1995), 91, state that Rev-independent Gag expression vectors were able to produce 
viral particles in human and mouse cells in the absence of any other HTV proteins, 
and that additional mutations in the pol region allowed the expression of the 
protease and the processing of the p55 gag precursor. Direct DNA injection of TAT 
and Rev independent Gag expression vectors in mouse muscle resulted in Gag 
expression detected by ELISA and in anti-gag antibody response. Several Rev-and 
Tat- independent Gag expression cassettes were inserted into retroviral vectors and 
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cell lines expressing Gag or Gag fragments that are dominant negative inhibitors of 
HTV-1 were constructed. 

Shiver et al. (1996) describe the results of DNA vaccination of mice 
and non-human primates with mutated plasmid DNA encoding either mutated genes 
encoding HIV-1 gag (p55 gag) or env (gpl20 or gpl60). Both gag and env vaccine 
recipients exhibited antigen-specific cytotoxic and helper T lymphocyte (CTL, Th) 
responses. The results are stated to demonstrate that DNA vaccines elicited long- 
lived T cell responses in both mice and nonhuman primates that were disseminated 
throughout the lymphatics. 

m. SUMMARY OF THE INVENTION 

The invention relates to nucleic acids comprising the nucleic acid 
sequence of the mutated HIV-1 gag/pol gene shown in Figure 1 (SEQUENCE ID 
NO:l) and vectors and vector systems comprising these nucleic acids. 

The invention also relates to nucleic acids comprising the nucleic 
acid sequence of the mutated SIV gag gene shown in Figure 3 and vectors and 
vector systems comprising these nucleic acids. 

The invention also relates to nucleic acids comprising the mutated 
SIV env gene shown in Figure 1 7 and vectors and vector systems comprising these 
nucleic acids. ' 

The invention also relates to products produced by the nucleic acids, 
e.g., mRNA, protein, and infectious viral particles. 

The invention also relates to compositions comprising these nucleic 
acids and/or their expression products. 

The invention also relates to host cells comprising these nucleic 
acids, vector systems or viral particles. 

The invention also relates to uses of these nucleic acids, vector 
systems, host cells, expression products, and/or compositions to produce mRNA, 
proteins, and/or infectious viral particles, and/or to induce antibodies and/or 
cytotoxic or helper T lymphocytes. 

The invention also relates to the use of these nucleic acid constructs, 
vectors, vector systems and or host cells for use in immunotherapy and 
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immunoprophylaxis, e.g., as a vaccine, or in genetic therapy after expression, 
preferably in humans. The nucleic acid constructs of the invention can include or be 
incorporated into lentiviral vectors or other expression vectors or they may also be 
directly injected into tissue cells resulting in efficient expression of the encoded 
protein or protein fragment. These constructs may also be used for in-vivo or in- 
vitro gene replacement, e.g., by homologous recombination with a target gene in- 
situ. 

IV. BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 . DNA sequence of a mutated HIV-1 gag/pol molecular clone 
(SEQUENCE ID NO: 1) ' The gagpol terminator is located at positions 4305-4397 
of SEQUENCE ID NO: 1 . 

Fig. 2. Comparison of the sequence of the wild -type and mutated 
pol region in pCMVgagpolBNkan. Position #1 in the figure is position 2641 in 
plasmid pCMVgagpolBNkan. The comparison starts at position 1872 from the gag 
initiator ATG. 

Fig. 3. DNA sequence of a mutated SIV gag molecular clone 

(SIVgagDX). 

Fig. 4. Comparison of the mutated SIV gag DNA sequence in 
SIVgagDX with the wild type SIV sequence from Simian (macaque) 
immunodeficiency virus isolate 239, clone lambda siv 239-1 (GenBank accession 
No. M33262). 

Fig. 5. Schematic diagram of some components of sample versions 
of a lentiviral system. BGH poly (A): bovine growth hormone poly (A) signal; 
MSD: mutated splice donor site; \|/: encapsidation signal; SD, splice donor site; 
SA, splice acceptor site; "X" indicates that the ATG codon of the partial gag gene 
sequence is mutated so that translation of this gene does not occur. 

Fig. 6. Schematic diagram of the packaging construct 
pCMVgagpolBNkan. 

Fig. 7. Schematic diagram of transfer construct 1 : pmBCwCNluci. 
The packaging signal, the CMV promoter and the coding region for the luciferase 
gene are flanked by the 5* and 3 HTV-1 LTRs, which provide promoter and 
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polyadenylation signals, as indicated by the arrows. Three consecutive arrows 
indicate the U5, R, and U3 regions of the LTR, respectively. The transcribed 
portions of the LTRs are shown in black. Some restriction endonuclease cleavage 
sites are also indicated. 

Fig. 8. Schematic diagram of transfer construct 1 : pmBCmCNluci. 
Symbols are as above. 

Fig. 9. DNA sequence of packaging construct pCMVgagpolBNkan. 

Fig. 10. DNA sequence of transfer construct 1 : pmBCwCNluci. 

Fig. 11. DNA sequence of transfer construct 1 : pmBCmCNluci. 

Figure 12: 

Fig. 12. Nucleotide sequence of the region BssHII (71 1) to Clal 
(830) in wild-type HTV-1 molecular clones HXB2 and NL4-3, and in the transfer 
constructs. The translation initiator signal for Gag protein (ATG) is underlined. 
pmBCwCNluci and pmBCmCNluci (transfer constructs 1 and 2) contain the 
sequence mBCwCN. Transfer construct 3 contains the sequence m2BCwCN. In 
contrast to the sequence mBCwCN, m2BCwCN has different mutations at the 5' 
splice site region arid has an intact Gag ATG. 

Fig. 13. Bar graph showing levels of gag protein that is released 
from cells upon transient transfection with pCMVgagpolBNkan (labeled 
pCMVBNKan in the figure). 

Fig. 14. Bar graph showing reverse transcriptase activity from the 
Rev-independent gag-pol HIV-1 vector pCMVgagpolBNkan (labeled pCMVBNKan 
in the figure). 

Fig. 15. Bar graphs showing the amount of luciferase per nanogram 
of p24 Gag protein detected in cells transducted with PCMVgagpolBNkan Rev- 
independent gag-HIV- 1 based retroviral vectors. The results show that with 
PCMVgagpolBNkan Rev-independent gag-HTV-1 based retroviral vectors display 
high transduction efficiency in (A) 293 cells, (B) human lymphoid cells, (C) human 
myeloid cells (U937), as well as (D) non-dividing cells such as primary human 
macrophages. 

Fig. 16. Schematic diagram of the SIV envelope encoding vector 
CMVkan/R-R-SIVenvCTE. 
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Fig. 17. DNA sequence of the SIV envelope encoding vector 
CMVkan/R-R-SIVenvCTE containing a mutated SIV env gene. 

V. MODES FOR CARRYING OUT THE INVENTION 

It is to be understood that both the foregoing general description and 
the following detailed description are exemplary and explanatory only, and are not 
restrictive of the invention, as claimed. The accompanying drawings, which are 
incorporated in and constitute a part of the specification, illustrate an embodiment of 
the invention and, together with the description, serve to explain the principles of 
the invention. 

One aspect of the invention comprises vectors that encode the Gag 
and/or Pol of HTV-1 in a Rev-independent manner. An example of such a vector 
which is described herein is the plasmid pCMVgagpolBNkan, which encodes the 
complete Gag and Pol of HTV-1 in a Rev-independent manner, and also contains a 
gene conferring kanamycin resistance. This plasmid is Tat and Rev-independent 
and was generated by eliminating the inhibitory/instability sequences present in the 
gag/pol mRNA without altering the amino acid sequence of the proteins coded by 
the genes. 

The gag/pol clone of the invention is a DNA construct of the gag/pol 
region of HIV which has had the inhibitory/instability regions removed. The 
construct is expected to be useful as a component a new type of lentivirus vector for 
use in gene therapy or as a vaccine. 

The gag, pol or gag/pol sequences of the invention can be highly 
expressed in human and other mammalian cells in the absence of any other 
regulatory and structural protein of HTV, including Rev. When the gag/pol 
sequences are combined with a sequence encoding an envelope protein, such as the 
VSV G protein or the HTV envelope protein (e.g., in the same vector or in another 
expression vector), infectious virus is produced after transfection into human cells. 
When a gene encoding a non-HIV envelope protein is used, for example, in the 
presence of the HIV gag/pol gene, the virus particles produced would contains only 
the HIV proteins Gag and Pol. 
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Lentiviral vectors or vector systems based on the gag/pol or gag/pol 
sequences of this invention, as exemplified by the Rev-independent pCMVgagpol 
BNkan construct described herein, may be used for gene therapy in vivo (e.g., 
parenteral inoculation of high titer vector) or ex vivo (e.g., in vitro transduction of 
patient's cells followed by reinfusion into the patient of the transduced cells). These 
procedures are been already used in different approved gene therapy protocols. 

The HTV gag/pol clone and SIV gag clone of the invention were 
made using the method for eliminating inhibitory/instability regions from a gene as 
described in U.S. patent application Serial No. 07/858,747, filed March 27, 1992, 
and also in related U.S. Patent Nos. 5,972,596 and 5,965,726, which are 
incorporated by reference herein. This method does not require the identification of 
the exact location or knowledge of the mechanism of function of the INS. 
Generally, the mutations are such that the amino acid sequence encoded by the 
mRNA is unchanged, although conservative and non-conservative amino acid 
substitutions are also envisioned where the protein encoded by the mutated gene is 
substantially similar to the protein encoded by the non-mutated gene: The mutated 
genes can be synthetic (e.g., synthesized by chemical synthesis), semi-synthetic 
(e.g., a combination of genomic DNA, cDNA, or PGR amplified DNA and synthetic 
DNA), or recombinantly produced. The genes also may optionally not contain 
introns. The nucleic acids of the invention may also contain Rev-independent 
fragments of these genes which retain the desired function (e.g., for antigenicity of 
Gag or Pol, particle formation (Gag) or enzymatic activity (Pol)), or they may also 
contain Rev-independent variants which have been mutated so that the encoded 
protein loses a function that is unwanted in certain circumstances. In the latter case, 
for example, the gene may be modified to encode mutations (at the amino acid 
level) in the active site of reverse transcriptase or integrase proteins to prevent 
reverse transcription or integration. Rev-independent fragments of the gag gene are 
described in U.S. patent application Serial No. 07/858,747, filed March 27, 1992, 
and also in related U.S. Patent Nos. 5,972,596 and 5,965,726, which are 
incorporated by reference herein. 

In addition to being capable of producing HTV Gag and Pol proteins 
in the absence of Rev regulatory protein in a cell in vivo, the HIV gag/pol clone and 
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SIV gag clone of the invention are also capable of producing HIV Gag and Pol 
proteins in the absence of any added cis acting transport element, such as CTE or 
CTE-like elements (collectively refered herein as RNA Transport Elements (RTE)). 
Experiments indicate that the mutated vectors of the invention for SIV gag are fax 
superior to those adding CTE (see Qiu et al., J Virol. 73:9145-52 (1999)). 

The expression of the proteins encoded by these vectors after 
transfection into human cells may be monitored at both the level of RNA and 
protein production. RNA levels are quantitated by methods known in the art, e.g., 
Northern blots, SI mapping or PCR methods. Protein levels may also be 
quantitated by methods known in the art, e.g., western blot or ELISA or fluorescent 
detection methods. A fast non-radioactive ELISA protocol can be used to detect 
gag protein (DUPONT or COULTER gag antigen capture assay). 

At least three types of lentiviral vectors based on the gag/pol genes 
of the invention for use in gene therapy and/or as a vaccine are envisioned, i.e., 
lentiviral vectors having 

a) no round of replication (i.e., a zero replication system) 

b) one round of replication 

c) a fully replicating system 

For a system with no round of replication, a gag/pol gene, or separate 
gag and pol genes, or fragments of these genes, expressed using appropriate 
transcription units, e.g., a CMV promoter and a BGH poly (A) site. This will allow 
expression of the gag/pol unit (or gag or pol or fragment(s) thereof) for vaccine 
purposes. This expression can be accomplished without the production of any 
functional retroviral enzymes, provided that the appropriate mutation(s), e.g., a 
missense mutation, are introduced. In a zero replication system, a virus stock will 
be administered to the cells or animals of interest. For example, if one creates and 
uses a virus stock with the exemplified system using the packaging vector 
PCMVgagpolBNKan, the transfer construct pmBCwCNluci or pmBCmCNluci, and 
the envelope containing vector pHCMV-G, one obtains a zero replication system. 
The virus particles produced by such system can infect cells, and the reverse 
transcribed transfer construct DNA will go into the nucleus but, because the coding 
regions for viral structural proteins are not present, there will be no virus expression 
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and replication (0 rounds). If one transfects cells in vivo with the same 3 DNAs, 
they will go to the nucleus, express viral proteins, make infectious virus particles 
and go out and infect another cell or cells (1 round). Since in vivo delivery of three 
plasmids may result in lower expression, at least two different embodiments are 
envisioned. In the first, two plasmids may be used, e.g., MVl shown in Fig. 5 and 
an envelope expression plasmid such as pHCMV-G. Other plasmids encoding 
functional envelopes from HTV, SIV, or other retroviruses can also be used. 
Transfection by the two plasmids results in infectious virus that can infect and 
integrate into new cells (1 round). The infected cells produce gagpol but virus 
propagation is not possible in the absence of env. , 

For a system with one round of replication, at least two additional 
embodiments are envisioned. In the first method, a combination of the genes, e.g., a 
gag/pol gene, an env encoding gene and, preferably, a gene encoding a reporter 
protein or other polynucleotide or protein of interest, are delivered into the cells of 
interest in vivo. As discussed above for the exemplified system, if one transfects 
cells in vivo with the same 3 DNAs, they will go to the nucleus, express viral 
proteins, make infectious virus particles, be released and infect another cell or cells 
(1 round). 

In another embodiment, the same result (i.e., only one round of 
replication) can be obtained by using transfer vectors that have deletions in the 3' 
LTR and in which a heterologous-promoter (e.g., the CMV-promoter, or inducible 
promoter, or tissue-specific promoter), is used in place of the '3 'LTR promoter. 
The mutations in the 3' LTR making it inactive upon reverse transcription and 
integration. This is because the integrated provirus derives both its 5' LTR and its 3' 
LTR from the 3' LTR of the starting (transfer) construct. (This is a well-known 
property of all retroviruses and has been used to make self-inactivating vectors 
(SIN)). There are several reasons one may want to inactivate the incoming LTR 
promoter, one of which is to use a different tissue specific or regulated promoter for 
expression of a gene of interest in the integrated provirus. Note that, with SIN 
vectors, if one uses a viral stock made in vitro after transfection into cells and 
collection of infectious virus, there will be no round of replication. If one transfects 
cells with the DNAs in vivo, there will be one round of replication. If functional 
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gag, pol, or em are not included in the DN A mix, there will not be any infection at 
all (i.e., infectious viruses will not be made). 

A fully replicating Rev-independent system has not been constructed 
yet, although it is expected that a functional system can be constructed using Rev- 
independent gag/pol and em sequences. If desired, extra posttranscriptional control 
elements such as the CTE element, which can replace Rev and give infectious virus 
(see e.g., Zolotukhin et al., J.Virol.68:944-7952 (1994)) are included. The fully 
replicating system should be in one piece, containing the LTR, packaging signal, 
gag/pol, splice site, env, tat, one or more CTE or CTE-like elements (if desired for 
optimal results), and LTR. Tat is thought to be required in this construct, at least in 
non-permissive cells. Such a system is depicted in Figure 5, (construct MV2). In 
this system, a cell or animal of interest (preferably human) would be infected with 
virus stock that then propagates. CTE or CTE-like elements (depicted in construct 
MV2 as RTE (RNA Transport Elements)) are desirable since they have been shown 
to improve expression, and since many retroviruses require the presence of 
posttranscriptional control elements. There are several types of CTE and CTE-like 
elements, and these elements appear to work via a different pathway from the Rev- 
RRE pathway. See, e.g., Tabernero et al., J Virol. 71:95-101 (1997). See also, 
Pavlakis and Nappi, PCT/US99/1 1082, filed May 22, 1999, published as WO 
99/61596 on December 2, 1999 (and incorporated herein by reference), which 
describes a new type of post-transcriptional control element that is able to replace 
CTE and HIV RRE/Rev, The Pavlakis-Nappi element does not work in the same 
way as CTE and does not have any sequence or structure homology. 

In a preferred embodiment, a lentivifal system of the invention 
comprises the following three components: 

1 . a packaging vector containing nucleic acid sequences 

encoding the elements necessary for vector packaging such as 
structural proteins (except for HIV em) and the enzymes 
required to generate vector particles, the packaging vector 
comprising at least a mutated HIV or SIV gag/pol gene of the 
invention; 
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2. a transfer vector containing genetic cis-acting sequences 
necessary for the vector to infect the target cell and for 
transfer of the therapeutic or reporter or other gene(s) of 
interest, the transfer vector comprising the encapsidation 
signal and the gene(s) of interest or a cloning site for inserting 
the gene(s) of interest; 

and 

3. a vector containing sequences encoding an element necessary 
for targeting the viral particle to the intended recipient cell, 
preferably the gene encoding the G glycoprotein of the 
vesicular stomatis virus (VSV-G) or amphotrophic MuLV or 
lentiviral envs. 

Using the CMV promoter or other strong, high efficiency, promoter 
instead of the HIV-1 LTR promoter in the packaging vector, high expression of gag, 
pol, or gag/pol can be achieved in the total absence of any other viral protein. The 
exchange of the HTV-1 LTR promoter with other promoters is beneficial in the 
packaging vector or other vectors if constitutive expression is desirable and also for 
expression in other mammalian cells, such as mouse cells, in which the HIV-1 
promoter is weak. Vectors containing the sequences of the invention can be used 
for the Rev independent production of HIV-1 Gag/Pol, HTV-1 Gag, HIV-1 Pol, and 
SIV Gag proteins. In certain embodiments, the presence of heterologous promoters 
will also be desired in the transfer vector and the envelope encoding vector, when 
such vectors are used. 

The gene(s) of interest are chosen according to the effect sought to be 
achieved. For gene therapy purposes there will be at least one therapeutic gene 
encoding a gene product which is active against the condition it is desired to treat or 
prevent. Alternatively or additionally, there may be a gene which acts as a marker 
by encoding a detectable product. Therapeutic genes may encode, for example, an 
anti-sense RNA, a ribozyme, a transdominant negative mutant of a target protein, a 
toxin, a conditional toxin, an antigen that induces antibodies or helper T-cells or 
cytotoxic T-cells, a single chain antibody or a tumor suppresser protein. See, e.g., 
WO 98/17816. 
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An even more extensive list of genes of interest for use in lentiviral 
vectors is described, e.g., in WO 99/04026 on page 10, line 20 to page 12, line 7. 
Table 2 of Klimatcheva et al. (1999) also provides a list of disorders and target cells 
for gene therapy, as well as a number of lentiviral vectors used by others. This list 
includes genetic/metabolic deficiencies, viral infection and cancer. Inherited 
genetic defects such as adenosine deaminase deficiency, familial 
hypercholesterolemia, cystic fibrosis, mucopolysaccharidosis type VII, types I and 
II diabetes, classical phenylketonuria and Gaucher disease are diseases which are 
listed as being possible to overcome by lentiviral vector-mediated gene therapy 
because they constitute single-gene deficiencies for which the involved genes are 
known. Viral diseases are also listed as constituting appropriate targets for lentiviral 
gene delivery. In particular, a number of gene therapy approaches have been 
proposed for the treatment of HIV infection and, for some of these strategies, phase 
I studies have recently begun in humans. The article states that preliminary studies 
have dealt with defective murine oncoviruses for delivery of anti-sense RNAs, 
ribozymes and trans-dominant proteins against HTV replication. 

In any of the vectors, but preferably in the transfer vector, an inserted 
gene could have an internal ribosomal entry site (IRES), e.g., from picornaviral 
RNA. An IRES will be used in circumstances that one wants to express two 
proteins from the same promoter. For example one protein of interest and a marker 
gene, e.g., green fluorescent protein (GFP) or a marker gene and a drug resistance 
gene (e.g. the firefly luciferase gene and neomycin phosphotransferase gene) as 
described on p. 58 of WO 99/04026, for example. Using an IRES the expression of 
the two proteins is coordinated. A further gene or genes may also be present under 
the control of a separate promoter. Such a gene may encode for example a 
selectable marker, or a further therapeutic agent which may be among the 
therapeutic agents listed above. Expression of this gene may be constitutive; in the 
case of a selectable marker this may be useful for selecting successfully transfected 
packaging cells, or packaging cells which are producing particularly high titers of 
the retroviral vector particles. Alternatively or additionally, the selectable marker 
may be useful for selecting cells which have been successfully infected with the 
lentiviral vector and have the provirus integrated into their own genome. 
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One way of performing gene therapy is to extract cells from a 
patient, infect the extracted cells with a lentiviral vector and reintroduce the cells 
back into the patient A selectable marker may be used to provide a means for 
enriching for infected or transduced cells or positively selecting for only those cells 
which have been infected or transduced, before reintroducing the cells into the 
patient. This procedure may increase the chances of success of the therapy. 
Selectable markers may be for instance drug resistance genes, metabolic enzyme 
genes, or any other selectable markers known in the art. Typical selection genes 
encode proteins that confer resistance to antibiotics and other toxic substances, e.g., 
histidinol, puromycin, hygromycin, neomycin, methotrexate etc. and cell surface 
markers. 

However, it will be evident that for many gene therapy applications 
of lentiviral vectors, selection for expression of a marker gene may not be possible 
or necessary. Indeed expression of a selection marker, while convenient for in vitro 
studies, could be deleterious in vivo because of the inappropriate induction of 
cytotoxic T lymphocytes (CTLs) directed against the foreign marker protein. Also, 
it is possible that for in vivo applications, vectors without any internal promoters 
will be preferable. The presence of internal promoters can affect for example the 
transduction titres obtainable from a packaging cell line and the stability of the 
integrated vector. Thus, single transcription unit vectors, which may be bi-cistronic 
or poly-cistronic, coding for one or two or more therapeutic genes, may be the 
preferred vector designed for use in vivo. See, e.g., WO 98/17816. 

Suitable host or producer cells for use in the invention are well 
known in the art. May lentiviruses have already been split into replication defective 
genomes and packaging components. For those which have not the technology is 
available for doing so. The producer cell encodes the viral components not encoded 
by the vector genome such as the Gag, Pol and Env proteins. The gag,pol and env 
genes may be introduced into the producer cell transiently, or may be stably 
integrated into the cell genome to give a packaging cell line. The lentiviral vector 
genome is then introduced into the packaging cell line by transfection or 
transduction to create a stable cell line that has all of the DNA sequences required to 
produce a lentiviral vector particle. Another approach is to introduce the different . 
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DNA sequences that are required to produce lentiviral vector particle, e.g., the env 
coding constrict, the gag-pol coding construct and the transfer construct into the cell 
simultaneously by transient triple transfection. 

Target cells identified by Klimatcheva et al. (1999), and the 
references cited therein, include airway epithelial cells for cystic fibrosis; retinal 
photoreceptor cells for retinitis pigmentosa; progenitors for red blood cells, 
macrophages, and lymphocytes for hematopoietic disorders, sickle cell anemia, B- 
thalassemia, lysosomal storage disorders, mucopolysaccharidoses, and severe 
combined immunodeficiency syndrome; bone marrow cells and macrophages for 
Gaucher's disease; liver cells for familial hypercholesterolaemia; T-lymphocytes 
and macrophages for HIV infection; brain tissue, neurons, and glial cells for 
neurodegenerative diseases such as Parkinson's and Alzheimer's diseases; 
endothelial cells and cardiac myocytes for cardiovascular diseases; and cancer cells 
in various tissues (e.g. liver or brain) for cancer. Target cells for other diseases 
would be apparent to one of skill in the art. 

Vaccines and pharmaceutical compositions comprising at least one of 
the nucleic acid sequences, vectors, vector systems, or transduced or transfected 
host cells of the invention and a physiologically acceptable carrier are also part of 
the invention. 

As used herein, the term 'transduction" generally refers to the 
transfer of genetic material into the host via infection, e.g., in this case by the 
lentiviral vector. The term "transfection" generally refers to the transfer of isolated 
genetic material into cells via the use of specific transfection agents (e.g., calcium 
phosphate, DEAE Dextran, lipid formulations, gold particles, and other 
microparticles) that cross the cytoplasmic membrane and deliver some of the 
genetic material into the cell nucleus. 

Systems similar to those described herein can be produced using 
elements of lenti viruses in addition to the HTV and/or SIV genes described herein. 

Pharmaceutical Compositions 

The pharmaceutical compositions of the invention contain a 
pharmaceutical^ and/or therapeutically effective amount of at least one nucleic acid 
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construct, vector, vector system, viral particle/virus stock, or host cell (i.e., agents) 
of the invention. In one embodiment of the invention, the effective amount of an 
agent of the invention per unit dose is an amount sufficient to cause the detectable 
expression of the gene of interest. In another embodiment of the invention, the 
effective amount of agent per unit dose is an amount sufficient to prevent, treat or 
protect against deleterious effects (including severity, duration, or extent of 
symptoms) of the condition being treated. The effective amount of agent per unit 
dose depends, among other things, on the species of mammal inoculated, the body 
weight of the mammal and the chosen inoculation regimen, as is well known in the 
art. The dosage of the therapeutic agents which will be most suitable for 
prophylaxis or treatment will also vary with the form of administration, the 
particular agent chosen and the physiological characteristics of the particular patient 
under treatment. The dose is administered at least once. Subsequent doses may be 
administered as indicated. 

To monitor the response of individuals administered the 
compositions of the invention, mRNA or protein expression levels may be 
determined. In many instances it will be sufficient to assess the expression level in 
serum or plasma obtained from such an individual. Decisions as to whether to 
administer another dose or to change the amount of the composition administered to 
the individual may be at least partially based on the expression levels. 

The term "unit dose" as it pertains to the inocula refers to physically 
discrete units suitable as unitary dosages for mammals, each unit containing a 
predetermined quantity of active material (e.g., nucleic acid, virus stock or host cell) 
calculated to produce the desired effect in association with the required diluent. The 
titers of the virus stocks to be administered to a cell or animal will depend on the 
application and on type of delivery (e.g., in vivo or ex vivo). The virus stocks can be 
concentrated using methods such as centrifugation. The titers to be administered ex 
vivo are preferably in the range of 0.001 to 1 infectious unit /cell. Another method 
of generating viral stocks is to cocultivate stable cell lines expressing the virus with 
the target cells. This method has been used to achieve better results when using 
traditional retroviral vectors because the cells can be infected over a longer period 
of time and they have the chance to be infected with multiple copies of the vector. 
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For in vivo administration of nucleic acid constructs, vectors, vector 
systems, virus stocks, or cells which have been transduced or transfected ex v/vo, the 
dose is to be determined by dose escalation, with the upper dose being limited by 
the onset of unacceptable adverse effects. Preliminary starting doses may be 
extrapolated from experiments using lentiviral vectors in animal models, by 
methods known in the art, or may be extrapolated from comparisons with known 
retroviral (e.g., adenoviral) doses. Generally, small dosages will be used initially 
and, if necessary, will be increased by small increments until the optimum effect 
under the circumstances is reached. Exemplary dosages are within the range of 10 8 
up to approximately 5 x 10 15 particles. 

Inocula are typically prepared as a solution in a physiologically 
acceptable carrier such as saline, phosphate-buffered saline and the like to form an 
aqueous pharmaceutical composition. 

The agents of the invention are generally administered with a 
physiologically acceptable carrier or vehicle therefor. A physiologically acceptable 
carrier is one that does not cause an adverse physical reaction upon administration 
and one in which the nucleic acids are sufficiently soluble to retain their activity to 
deliver a pharmaceutically or therapeutically effective amount of the compound. 
The pharmaceutically or therapeutically effective amount and method of 
administration of an agent of the invention may vary based on the individual patient, 
the indication being treated and other criteria evident to one of ordinary skill in the 
art. A therapeutically effective amount of a nucleic acid of the invention is one 
sufficient to prevent, or attenuate the severity, extent or duration of the deleterious 
effects of the condition being treated without causing significant adverse side 
effects. The route(s) of administration useful in a particular application are apparent 
to one or ordinary skill in the art. 

Routes of administration of the agents of the invention include, but 
are not limited to, parenteral, and direct injection into an affected site. Parenteral 
routes of administration include but are not limited to intravenous, intramuscular, 
intraperitoneal and subcutaneous. The route of administration of the agents of the 
invention is typically parenteral and is preferably into the bone marrow, into the 
CSF intramuscular, subcutaneous, intradermal, intraocular, intracranial, intranasal, 
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and the like. See, e.g., WO 99/04026 for examples of formulations and routes of 
administration. 

The present invention includes compositions of the agents described 
above, suitable for parenteral administration including, but not limited to, 
pharmaceutically acceptable sterile isotonic solutions. Such solutions include, but 
are not limited to, saline and phosphate buffered saline for nasal, intravenous, 
intramuscular, intraperitoneal, subcutaneous or direct injection into a joint or other 
area. 

In providing the agents of the present invention to a recipient 
mammal, preferably a human, the dosage administered will vary depending upon 
such factors as the mammal's age, weight, height, sex, general medical condition, 
previous medical history and the like. 

The administration of the pharmaceutical compositions of the 
invention may be for either "prophylactic" or "therapeutic" purpose. When 
provided prophylactically, the compositions are provided in advance of any 
symptom. The prophylactic administration of the composition serves to prevent or 
ameliorate any subsequent deleterious effects (including severity, duration, or extent 
of symptoms) of the condition being treated. When provided therapeutically, the 
composition is provided at (or shortly after) the onset of a symptom of the condition 
being treated. 

For all therapeutic, prophylactic and diagnostic uses, one or more of 
the agents of the invention, as well as antibodies and other necessary reagents and 
appropriate devices and accessories, may be provided in kit form so as to be readily 
available and easily Used. 

Where immunoassays are involved, such kits may contain a solid 
support, such as a membrane (e.g., nitrocellulose), a bead, sphere, test tube, rod, and 
so forth, to which a receptor such as an antibody specific for the target molecule will 
bind. Such kits can also include a second receptor, such as a labeled antibody. Such 
kits can be used for sandwich assays to detect toxins. Kits for competitive assays 
are also envisioned. 
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VI. INDUSTRIAL APPLICABILITY 

Mutated genes of this invention can be expressed in the native host 
cell or organism or in a different cell or organism. The mutated genes can be 
introduced into a vector such as a plasmid, cosmid, phage, virus or mini- 
chromosome and inserted into a host cell or organism by methods well known in the 
art. In general, the mutated genes or constructs containing these mutated genes can 
be utilized in any cell, either eukaryotic or prokaryotic, including mammalian cells 
(e.g., human (e.g., HeLa), monkey (e.g., Cos), rabbit (e.g., rabbit reticulocytes), rat, 
hamster (e.g., CHO and baby hamster kidney cells) or mouse cells (e.g., L cells), 
plant cells, yeast cells, insect cells or bacterial cells (e.g., R coli). The vectors 
which can be utilized to clone and/or express these mutated genes are the vectors 
which are capable of replicating and/or expressing the mutated genes in the host cell 
in which the mutated genes are desired to be replicated and/or expressed. See, e.g., 
F. Ausubel et al., Current Protocols in Molecular Biology, Greene Publishing 
Associates and Wiley-Interscience (1992) and Sambrook et al. (1989) for examples 
of appropriate vectors for various types of host cells. The native promoters for such 
genes can be replaced with strong promoters compatible with the host into which 
the gene is inserted. These promoters may be inducible. The host cells containing 
these mutated genes can be used to express large amounts of the protein useful in 
enzyme preparations, pharmaceuticals, diagnostic reagents, vaccines and 
therapeutics. 

Mutated genes or constructs containing the mutated genes may also 
be used for in-vivo or in-vitro gene therapy. For example, a mutated gene of the 
invention will produce an mRNA in situ to ultimately increase the amount of protein 
expressed. Such gene include viral genes and/or cellular genes. Such a mutated 
gene is expected to be useful, for example, in the development of a vaccine and/or 
genetic therapy. 

The constructs and/or proteins made by using constructs encoding 
the mutated gag, env, and pol genes could be used, for example, in the production of 
diagnostic reagents, vaccines and therapies for AIDS and AIDS related diseases. 
The inhibitory/instability elements in the HTV-1 gag gene may be involved in the 
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establishment of a state of low virus production in the host HIV-1 and the other 
lentiviruses cause chronic active infections that are not cleared by the immune 
system. It is possible that complete removal of the inhibitory/instability sequence 
elements from the lentiviral genome would result in constitutive expression. This 
could prevent the virus from establishing a latent infection and escaping immune 
system surveillance. The success in increasing expression of the entire gag/pol gene 
by eliminating the inhibitory sequence element suggests that one could produce 
lentiviruses without any negative elements. Such lentiviruses could provide a novel 
approach towards attenuated vaccines. 

For example, vectors expressing high levels of Gag can be used in 
immunotherapy and immunoprophylaxis, after expression in humans. Such vectors 
include retroviral vectors and also include direct injection of DNA into muscle cells 
or other receptive cells, resulting in the efficient expression of gag, using the 
technology described, for example, in Wolff et al., Science 247:1465-1468 (1990), 
Wolff et al., Human Molecular Genetics l(6):363-369 (1992) and Ulmer et al., 
Science 259:1745-1749 (1993). Further, the gag constructs could be used in 
transdominant inhibition of HIV expression after the introduction into humans. For 
this application, for example, appropriate vectors or DNA molecules expressing 
high levels of p55 gag or p37 gag would be modified to generate transdominant gag 
mutants, as described, for example, in Trono et al., Cell 59:113-120 (1989). The 
vectors would be introduced into humans, resulting in the inhibition of HIV 
production due to the combined mechanisms of gag transdominant inhibition and of 
immunostimulation by the produced gag protein. In addition, the gag constructs of 
the invention could be used in the generation of new retroviral vectors based on the 
expression of lentiviral gag proteins. Lentiviruses have unique characteristics that 
may allow the targeting and efficient infection of non-dividing cells. Similar 
applications are expected for vectors expressing high levels of env. 

Identification of similar inhibitory/instability elements in SIV 
indicates that this virus is a convenient model to test these hypotheses. SIV 
similarly modified could be used in place of HIV in an effort to further minimize the 
possibility of rearrangement events that would lead to the generation of infectious 
HIV. 
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The following examples illustrate certain embodiments of the present 
invention, but should not be construed as limiting its scope in any way. Certain 
modifications and variations will be apparent to those skilled in the art from the 
teachings of the foregoing disclosure and the following examples, and these are 
intended to be encompassed by the spirit and scope of the invention. 

EXAMPLE 1 
Rev-Independent HTV-1 Gag/Pol Molecular Clone 

Figure 1 shows the DNA sequence of a Rev-independent HIV-1 
gag/pol molecular clone. This DNA sequence shown encodes the complete Gag and 
Pol of HTV-1 and can be expressed in a Rev-independent manner when operably 
linked to a promoter. The Rev-independent gag sequence was described in U.S. 
Patent Nos. 5,972,596 and 5,965,726 and the Rev-independent pol sequence was 
generated by eliminating the inhibitory/instability sequences using the methods 
described in U.S. Patent Nos. 5,972,596 and 5,965,726. Others have reportedly 
made Rev independent gag sequences by optimizing codon usage for human cells 
(see, e.g, WO 98/34640). 

Figure 2 shows an alignment of the sequence of the wild - type and 
mutated pol region in pCMVgagpolBNkan. Position #1 in the figure is position 
2641 in plasmid pCMVgagpolBNkan. 

The elimination of INS in gag, pol and env regions allows the 
expression of high levels of authentic HTV-1 structural proteins in the absence of the 
Rev regulatory factor of HTV-1. 

EXAMPLE 2 

Rev-Independent SIV Gag Molecular Clone 

Figure 3 shows the DNA sequence of a Rev-independent SIV gag 
molecular clone, STVgagDX. Figure 4 shows the comparison of wild type (WT) 
and mutant (STVgagDX) sequences. The wild type STV sequence is from Simian 
(macaque) immunodeficiency virus isolate 239, clone lambda siv 239-1 (GenBank 
accession No. M33262). 
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EXAMPLE 3 
Rev-Independent SIV Env Molecular Clone 

Figure 16 shows a schematic diagram, and figure 17 shows the DNA 
sequence, of the "env-coding" vector CMVkan/R-R-SIVenvCTE, which is an 
example of a vector comprising a mutated lentiviral env gene sequence which is 
capable of being expressed independently of any SIV or HTV regulatory factors. 
"CMV" denotes the cytomegalovirus promoter; "SRV-CTE" denotes the 
constitutive transport element (CTE) of Simian Retrovirus Type 1 ; "all-STOP" 
denotes a sequence providing translational stops in all three reading frames; "BGH 
terminator" denotes the bovine growth hormone polyadenylation signal. Other 
posttranscriptional control elements can be used instead of the indicated SRV-CTE, 
for example the one described by Pavlakis and Nappi, PCT/US99/1 1082, filed May 
22, 1999, which was published as WO 99/61596 on December 2, 1999 (and which 
is incorporated herein by reference). 

As mentioned previously above, such a vector encoding a lentiviral 
env gene may be used if it is desired that the vector infect CD4 + T cells. Also as 
mentioned previously above, the CTE element (i.e., the SRV-CTE element in the 
case of vector CMVkan/R-R-SIVenvCTE), can be replaced with another post- 
transcriptional control element, such as the Pavlakis-Nappi element, that is able to 
replace CTE and HIV RRE/Rev. See Pavlakis and Nappi, PCT/US99/1 1 082, filed 
May 22, 1999, which was published as WO 99/61596 on December 2, 1999 (and 
which is incorporated herein by reference). 

EXAMPLE 4 

Lentivirial Vector System 

Figure 5 is a schematic of some of the components of a preliminary 
version of the Rev-independent lentiviral vector system exemplified herein, 
including a packaging construct and three different transfer vectors which may be 
used. In the lentiviral system exemplified herein, the packaging construct also 
contains the gene for kanamycin resistance. The lentiviral system exemplified 
herein also contains the vector pHCMV-G, which is shown in Figure 5. 
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In the packaging construct shown in Figure 5, "CMV" denotes the 
cytomegalovirus promoter, "Gag" denotes the gag gene, which generates 
components of the virion core, "Pro" denotes "protease" "RT" denotes "reverse 
transcriptase," s Int" denotes "integrase" and "BGH poly (A)" denotes the bovine 
growth hormone polyadenylation signal. The protease, reverse transcriptase, and 
integrase genes comprise the "pol" gene. In transfer construct 1, "LTR" denotes the 
HIV "long terminal repeat", which contains a HIV promoter; "mSD" denotes 
"mutated splice donor site," which is present in the construct so that splicing of the 
RNA transcript does not occur; "vj;" denotes the encapsidation signal; "wGA" 
denotes part of the wild-type gag gene which contains sequences believed to be 
necessary for encapsidation; "X" indicates that the ATG codon of the partial gag 
gene sequence is mutated so that translation of this gene does not occur; "CMV" 
denotes the cytomegalovirus promoter and luciferase is used as a reporter gene. 
Luciferase can be replaced with any gene of interest. Another HIV LTR is present 
at the 3 ' end of transfer construct 1 . Replacement of this LTR in constructs such as 
the transfer construct 1, 2, or 3 with a promoter-enhancer deleted HIV LTR leads to 
inactivation of LTR after integration. Transfer construct 2 is similar to transfer 
construct 1 , the difference being that a mutated part of the gag gene (denoted 
"mGa") is used instead of the wild-type part of the gag gene. Transfer construct 3 
(pm2BCwCNluci) has different mutations at the 5' splice site and has an intact ATG 
codon so that translation of part of the mutated gag gene occurs. Transfer construct 
3 also has a 5' CMV promoter instead of a 5' LTR promoter. This construct is 
expressed independent of the presence of HIV Tat protein. The transfer constructs 
expressed from the LTR promoter are partially dependent on Tat protein. In 293 
cells significant expression can be achieved in the absence of Tat. See, e.g., 
Valentin et al., Proc. Natl Acad. Sci. USA. 95:8886-91 (1988). 

EXAMPLE 5 

Generation of Packaging Construct pCMVgagpol BNkan 

Figure 6 shows a schematic map of the packaging construct pCMV 
gagpolBNKan. The nucleotide numbering is that of the HXB2R sequence 
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(Genbank accession number K03455 and M38432), where +1 is the start of 
transcription. 

The sequence in HTV-1 gag/pol region was mutated in order to 
eliminate all the INS. The fragment from the beginning of gag to BsrGI site in pol, 
and the fragment KE [Kpnl(3700)- EcoRI(4194)] were previously mutated 
described in Schneider et al., J Virol. 71 : 4892-4903 (1997) and in U.S. Patent Nos. 
5,972,596 and 5,965,726. 

To generate pCMVgagpolBNkan, three fragments within HTV-1 pol 
region were mutated. They are fragment BP [BsrGI(2207)-PflMI(3032)], fragment 
PK [PflMI(3032)-KpnI(3700)] and fragment EN [EcoRI(4194)-NdeI(4668)]. 
Mutagenesis was performed using a modified version of the method described by 
Ho et al., Gene 77: 51-59 (1989) and DNA shuffling (Zhao and Arnold, Nucl. Acid 
Res. 25(6), 1307-1308 (1997). Sixteen oligonucleotides extending over the 
complete sequence of the three fragments were designed. Six oligos corresponded 
to fragment BP, six to fragment PK, and four to fragment EN (the oligonucleotides 
ranged from 130 to 195 bases in length; adjacent oligos overlapped by twenty 
nucleotides). Each fragment was assembled in two steps: 

1) PCR; the reaction was carried out in standard pfu buffer with 
10 pmol of each purified big oligo, 0.2 mM of each dNTPs and 2.5 u pfu DNA 
polymerase enzyme (Stratagene) in a 50 (il final volume. The PCR program was: 3 
min 96°C followed by 50 cycles of 1 min 94°C, 1 min 55°C, and 1 min + 5 s/cycle 
72°C, ended by 7 min at 72°C. After PCR, the big oligonucleotides were removed 
from the assembled mutated fragment. 

2) The second step was to specifically amplify the assembled 
products with 30 mer primers located at the 5 1 and 3' end of each mutated fragment. 
One microliter of the assembled PCR product was used as template in a 25-cycle 
PCR reaction with 50 pmol of each primer, 1 x pfu buffer, 0.2 mM of each dNTP 
and 2.5 u pfu DNA polymerase in a 50 \jA final volume. The PCR program was: 3 
min 96°C, 10 cycles of 30 s 94°C, 30 s 55°C, 45 s 72°C, followed by another 14 
cycles of 30 s 94°C, 30 s 55°C, 45 s + 20 s/cycle 72°C, and finally 7 min 72°C. 
This program gave a single PCR product of the correct size. The amplified BP, PK 
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and EN fragments were individually cloned into PCR-script™ vector using PCR- 
script™ Amp SK(+) Cloning Kit (Stratagene). Clones were randomly selected and 
sequenced. The correct BP, PK and EN fragments together with fragment KE 
previously mutated by Schneider et al. were ligated between BsrGI and Kpnl site of 
p55AMl-R5 (which was previously described in Schneider et ah, J. Virol. 71: 4892- 
4903 (1 997)) to produce a completely mutated gagpol ORF. The new plasmid 
containing the completely mutated gag/pol was named pLTRgagpolBN. BN stands 
for the modification of the fragment between BsrGI and Ndel. The mutated gag/pol 
was then cloned into a CMVkan vector containing the cytomegalovirus major late 
promoter (GenBank accession no. XI 7403) and the kanamycin resistance gene, 
resulting in pCMVgagpolBNkan. The plasmid backbone comes from pVR1332 
provided by Vical Inc., and described in Hartikka et al., Hum Gene Ther. 7:1205-17 
(1996). 

It is understood that different plasmid backbones can be used, e.g., to 
provide good expression in vivo, in the case of DNA injection, for example. 

EXAMPLE 6 

Construction of Transfer Vectors pmBCwCNluci and pmBCmCNluci 

The HTV-1 sequence BC, between BssHII (257) and Clal (376), 
contains the major splice donor site and the encapsidation signal. Six oligos (33 to 
46 bases) were designed to introduce mutations on the splice donor site and the 
AUG start codon of gag. The BC fragment was assembled, amplified and 
sequenced as described in the section concerning the construction of 
pCMVgagpolBN. 

The mutated BC fragment and a fragment of wild type gag between 
Clal (376) and Nsi (793) were placed between the BssHII and Nsi sites of p55RRE 
(Schneider et al, J. Virol. 71:4892-4903 (1997)) to generate pmBCwCN. In 
parallel, the fragment between Clal (376) and Nsil sites of mutated gag from 
p55BMl-10SD+ was used to generate pmBCmCN. (p55BMl-10SD+ is similar to 
p55BMl-10, which is described in Schneider et al. (1997), but contains in addition 
the intact splice donor and encapsidation site upstream of gag). The region between 
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Nsil and Xhol containing 3' part of gag and RRE in pmBCwCN and pmBCraCN 
was replaced by a Clal-Xhol fragment containing CMV promoter and luciferase 
gene from pHR'-CMVluci (vector from D. Trono) to generate pmBCwCNluci and 
pmBCmCNluci (which are shown as transfer constructs 1 and 2 in Figure 5, and 
schematically depicted in Figures 7 and 8, respectively). The sequences of these 
plasmids are shown in Figures 10 and 11, respectively. Different versions of these 
plasmids have also been created, by standard procedures, with variations in the 
region of the encapsidation site, the first splice donor site, and the initiator gag 
AUG. For example, the transfer construct pm2BcwCNluci (which is shown as 
transfer construct 3 in Fig. 5) has different mutations in the 5' splice site region and 
has an intact ATG. A comparison of the sequences in the BssHII-Cla I region of 
transfer constructs 1 and 2 (mBCwCN frag), transfer construct 3 (m2BCwCN frag), 
HXB2 and NL43 is shown in Fig. 12. 

EXAMPLE 7 
Preparation of Viral Particles 

Lentiviral particles were generated by transient cotransfection of 293 
human kidney cells with a combination of three plasmids: pCMVgagpolBNkan, 
pmBCwCNluci or pmBCmCNluci (transfer vector) and pHCMV-G (Yee et al., 
Proc. Natl. Acad. Sci., USA, 91 :9564-9568 (1994) a plasmid coding for the 
envelope VSV-G (glycoprotein of vesicular stomatitis virus). 

The day before the transfection, 293 cells were plated at a density of 
TO 6 cells/plate on a 60 mm plate. Plasmid DNA was transfected by the Ca-phosphate 
precipitation method in the following proportions: 3 \ig packaging construct, 6 ^g 
transfer construct and 100 ng VSV-G encoding construct, pHCMV-G. [Note that 
the LTR promoter can be expressed in 293 cells in the absence of Tat with a 
moderate decrease in efficiency. The transfer constructs can be fully Tat 
independent after replacement of the LTR promoter with a CMV (see, e.g., transfer 
construct 3 in Fig. 5) or other promoter in such a way that the mRNA start site is at 
the beginning of the LTR R region.] In the present experiments for preparation of 
viral particles 500 ng of a tat expression plasmid was included in the transfection. 
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Cells were washed the day after transfection and were kept in 
DMEM medium for another 48 hours before the supernatants were harvested. 
Supematants were spun at 1,200 rpm for 7 mins to eliminate any floating cells. 
pCMVgagpolBNkan produces high levels of Gag protein that is efficiently released 
from the cells (Figure 13), and also produces high levels of functional Pol as judged 
by levels of reverse transcriptase activity similar to those found upon expression of 
complete HIV- 1 (Figure 14). 

Supernatants from 293 transfected cells were used to transduce 
several human cell lines (293, Jurkat, U937) and non-dividing human primary 
macrophages. 

EXAMPLES 

Cell Transduction 

Transduction was performed by incubating for 3-4 hours at 37°C the 
target cells with 1-2 ml of supernatant containing the retroviral vectors. The amount 
of retroviral vector present in the supernatant was normalized by p24 content 
(measured by ELISA). Equal amounts of p24 gag protein were used for infection of 
cells. This way, differences in production of the different preparations was 
minimized. 

The macrophages used for transduction were isolated from the 
peripheral blood of healthy donors by adherence to plastic. Cells were cultured in 
RPMI + 20% fetal calf serum (FCS) + 10% human serum (HS). After 1 week, non- 
adherent cells were washed off with PBS and the macrophages were kept in culture 
for another 1-2 weeks in the absence of human serum. The cells were washed 2-4 
times with PBS before transduction. 

Cells were harvested 48 hours after transduction (seven days for 
primary macrophages) and the transduction efficiency was determined by measuring 
luciferase activity in cell extracts from the cultures. The results of the transduction 
experiments in 293 Jurkat, U937 and primary macrophages are shown in Figure 
15A-D. These results demonstrate that Rev-independent gag-HIV-1 based 
retroviral vectors display high transduction efficiency in (A) 293 cells, (B) human 
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lymphoid cells, (C) human myeloid cells (U937), as well as (D) non-dividing cells 
such as primary human macrophages. 

EXAMPLE 9 

Use Of Nucleic Acids of the Invention 
In Immunoprophvlaxis Or Immunotherapy 

In postnatal gene therapy, new genetic information has been 
introduced into tissues by indirect means such as removing target cells from the 
body, infecting them with viral vectors carrying the new genetic information, and 
then reimplanting them into the body; or by direct means such as encapsulating 
formulations of DNA hi liposomes; entrapping DNA in proteoliposomes containing 
viral envelope receptor proteins; calcium phosphate co-precipitating DNA; and 
coupling DNA to a polylysine-glycoprotein carrier complex. In addition, in vivo 
infectivity of cloned viral DNA sequences after direct intrahepatic injection with or 
without formation of calcium phosphate coprecipitates has also been described. 
mRNA sequences containing elements that enhance stability have also been shown 
to be efficiently translated in Xenopus laevis embryos, with the use of cationic lipid 
vesicles. See, e.g., J. A. Wolff, et al., Science 247:1465-1468 (1990) and references 
cited therein. 

Recently, it has also been shown that injection of pure RNA or DNA 
directly into skeletal muscle results in significant expression of genes within the 
muscle cells. J.A. Wolff, et al., Science 247:1465-1468 (1990). Forcing RNA or 
DNA introduced into muscle cells by other means such as by particle-acceleration 
(N. -S. Yang, et al. Proc. Natl. Acad. Sci. USA 87:9568-9572 (1990); S.R. Williams 
et al., Proc. Natl. Acad. Sci. USA 88:2726-2730 (1991)) or by viral transduction 
should also allow the DNA or RNA to be stably maintained and expressed. In the 
experiments reported in Wolff et al., RNA or DNA vectors were used to express 
reporter genes in mouse skeletal muscle cells, specifically cells of the quadriceps 
muscles. Protein expression was readily detected and no special delivery system 
was required for these effects. Polynucleotide expression was also obtained when 
the composition and volume of the injection fluid and the method of injection were 
modified from the described protocol. For example, reporter enzyme activity was 
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reported to have been observed with 10 to 100 \xl of hypotonic, isotonic, and 
hypertonic sucrose solutions, Opti-MEM, or sucrose solutions containing 2mM 
CaCk and also to have been observed when the 10- to 100- jol injections were 
performed over 20 min. with a pump instead of within 1 nun. 

Enzymatic activity from the protein encoded by the reporter gene 
was also detected in abdominal muscle injected with the RNA or DNA vectors, 
indicating that other muscles can take up and express polynucleotides. Low 
amounts of reporter enzyme were also detected in other tissues (liver, spleen, skin, 
lung, brain and blood) injected with the RNA and DNA vectors. Intramuscularly 
injected plasmid DNA has also been demonstrated to be stably expressed in non- 
human primate muscle. S. Jiao et al., Hum. Gene Therapy 3:21-33 (1992). 

It has been proposed that the direct transfer of genes into human 
muscle in situ may have several potential clinical applications. Muscle is 
potentially a suitable tissue for the heterologous expression of a transgene that 
would modify disease states in which muscle is not primarily involved, in addition 
to those in which it is. For example, muscle tissue could be used for the 
heterologous expression of proteins that can immunize, be secreted in the blood, or 
clear a circulating toxic metabolite. The use of RNA and a tissue that can be 
repetitively accessed might be useful for a reversible type of gene transfer, 
administered much like conventional pharmaceutical treatments. See J. A. Wolff, et 
al., Science 247:1465-1468 (1990) and S. Jiao et al., Hum. Gene Therapy 3:21-33 
(1992). 

It had been proposed by J.A. Wolff et al., supra , that the intracellular 
expression of genes encoding antigens might provide alternative approaches to 
vaccine development. This hypothesis has been supported by a recent report that 
plasmid DNA encoding influenza A nucleoprotein injected into the quadriceps of 
BALB/c mice resulted in the generation of influenza A nucleoprotein-specific 
cytotoxic T lymphocytes (CTLs) and protection from a subsequent challenge with a 
heterologous strain of influenza A virus, as measured by decreased viral lung titers, 
inhibition of mass loss, and increased survival. J. B. Ulmer et al., Science 
259:1745-1749(1993). 
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Therefore, it appears that the direct injection of RNA or DNA vectors 
encoding the viral antigen can be used for endogenous expression of the antigen to 
generate the viral antigen for presentation to the immune system without the need 
for self-replicating agents or adjuvants, resulting in the generation of antigen- 
specific CTLs and protection from a subsequent challenge with a homologous or 
heterologous strain of virus. 

CTLs in both mice and humans are capable of recognizing epitopes 
derived from conserved internal viral proteins and are thought to be important in the 
immune response against viruses. By recognition of epitopes from conserved viral 
proteins, CTLs may provide cross-strain protection. CTLs specific for conserved 
viral antigens can respond to different strains of virus, in contrast to antibodies, 
which are generally strain-specific. 

Thus, direct injection of RNA or DNA encoding the viral antigen has 
the advantage of being without some of the limitations of direct peptide delivery or 
viral vectors. See J.A. Ulmer et al., supra, and the discussions and references 
therein). Furthermore, the generation of high-titer antibodies to expressed proteins 
after injection of DNA indicates that this may be a facile and effective means of 
making antibody-based vaccines targeted towards conserved or non-conserved 
antigens, either separately or in combination with CTL vaccines targeted towards 
conserved antigens. These may also be used with traditional peptide vaccines, for 
the generation of combination vaccines. Furthermore, because protein expression is 
maintained after DNA injection, the persistence of B and T cell memory may be 
enhanced, thereby engendering long-lived humoral and cell-mediated immunity. 

1. Vectors for the immunoprophylaxis or 
immunotherapy against HIV-1 

The mutated gag>pol or gag/pol sequences will be inserted in 
expression vectors using a strong constitutive promoter such as CMV or RS V, or an 
inducible promoter such as HTV-1. 

The vector will be introduced into animals or humans in a 
pharmaceutically acceptable carrier using one of several techniques such as 
injection of DNA directly into human tissues; electroporation or transfection of the 
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DNA into primary human cells in culture (ex vivo), selection of cells for desired 
properties and reintroduction of such cells into the body, (said selection can be for 
the successful homologous recombination of the incoming DNA to an appropriate 
preselected genomic region); generation of infectious particles containing the gag 
gene, infection of cells ex vivo and reintroduction of such cells into the body; or 
direct infection by said particles in vivo. 

Substantial levels of protein will be produced leading to an efficient 
stimulation of the immune system. 

In another embodiment of the invention, the described constructs will 
be modified to express mutated Gag proteins that are unable to participate in virus 
particle formation. It is expected that such Gag proteins will stimulate the immune 
system to the same extent as the wild-type Gag protein, but be unable to contribute 
to increased HIV-1 production. This modification should result in safer vectors for 
immunotherapy and immunophrophylaxis. 

EXAMPLE 10 

Inhibition of HTV-1 Expression Using Transdominant 
{TDVTD-Gag-TD Rev or Td Gag-Pro-TD Rev Genes 

Direct injection of DNA or use of vectors other than retroviral 
vectors will allow the constitutive high level of trans-dominant Gag (TDgag) in 
cells. In addition, the approach taken by B.K. Felber et al., Science 239:184-187 
(1988) will allow the generation of retroviral vectors, e.g. mouse-derived retroviral 
vectors, encoding HTV-1 TDgag, which will not interfere with the infection of 
human cells by the retroviral vectors. In the approach of Felber, et al., supra , it was 
shown that fragments of the HIV-1 LTR containing the promoter and part of the 
polyA signal can be incorporated without detrimental effects within mouse 
retroviral vectors and remain transcriptionally silent. The presence of Tat protein 
stimulated transcription from the HIV-1 LTR and resulted in the high level 
expression of genes linked to the HTV-1 LTR, 

The generation of hybrid TDgag-TDRev or TDgag-pro-TDRev genes 
and the introduction of expression vectors in human cells will allow the efficient 
production of two proteins that will inhibit HTV-1 expression. The incorporation of 
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two TD proteins in the same vector is expected to amplify the effects of each one on 
viral replication. The use of the HTV-1 promote in a matter similar to one 
described in B.K. Felber, et aL, supra, will allow high level Gag and Rev expression 
in infected cells. In the absence of infection, expression will be substantially lower. 
Alternatively, the use of other strong promoters will allow the constitutive 
expression of such proteins. This approach could be highly beneficial, because of 
the production of a highly immunogenic gag, which is not able to participate in the 
production of infectious virus, but which, in fact, antagonizes such production. This 
can be used as an efficient immuniprophylactic or immunotherapeutic approach 
against AIDS. 

Examples of trans-dominant mutants are described in Trono et aL, 
Cell 59:112-120 (1989). 

1 . Generation of constructs encoding transdominant Gag mutant 

proteins 

Gag mutant proteins that can act as trans-dominant mutants, as 
described, for example, in Trono et aL, supra, will be generated by modifying vector 
p37Ml-10D or p55Ml-13P0 to produce transdominant Gag proteins at high 
constitutive levels. 

The transdominant Gag protein will stimulate the immune system 
and will inhibit the production of infectious virus, but will not contribute to the 
production of infectious virus. 

The added safety of this approach makes it more acceptable for 
human application. 
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Those skilled in the art will recognize that any gene encoding a 
mRNA containing an inhibitory/instability sequence or sequences can be modified 
in accordance with the exemplified methods of this invention or their functional 
equivalents. 

Modifications of the above described modes for carrying out the 
invention that are obvious to those of skill in the fields of genetic engineering, 
virology, immunology, medicine, and related fields are intended to be within the 
scope of the following claims. 

Every reference cited hereinbefore throughout the application is 
hereby incorporated by reference in its entirety. 
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WHAT IS CLAIMED IS : 

1 . A nucleic acid construct comprising a HTV-1 gag/pol gene 
having the coding sequence of the gag/pol gene set forth in Figure 1 . 

2. A nucleic acid construct comprising a HTV-1 pol gene having 
the coding sequence of the pol gene set forth in Figure 2. 

3. A nucleic acid construct comprising a SIV-1 gag gene having 
the coding sequence of the gag gene set forth in Figure 3. 

4. A nucleic acid construct comprising an HIV or SIV 5' LTR, 
a packaging signal, a gag/pol gene comprising the sequence set forth in Figure 1, a 
.5' splice site, a 3' splice site, an em gene, a tat gene, a functional RNA transport 
element and a 3 * HTV or SIV LTR, said nucleic acid construct being able to produce 
functional Gag, Pol and Env virion components. 

5. A vector comprising the nucleic acid construct of Claim 1, 2, 



3 or 4. 



of Claim 1,2, 3 or 4 



A transformed host cell comprising the nucleic acid construct 



eukaryote. 



prokaryote. 



7. A transformed host cell of Claim 6 wherein said cell is a 

8. The host cell of Claim 7 wherein said cell is a human cell. 

9. A transformed host cell of Claim 6 wherein said cell is a 



10. The host cell of Claim 9 wherein said cell is E. coli . 

11. A pharmaceutical composition comprising the nucleic acid 
construct of Claim 1, 2, 3 or 4 and a pharmaceutically acceptable carrier. 

12. A method for inducing antibodies in a mammal comprising 
administering to a mammal a composition of claim 11, wherein said nucleic acid 
construct is present in an amount which is effective to induce said antibodies in said 
mammal. 

13. A method for inducing cytotoxic T lymphocytes in a mammal 
comprising administering to a mammal a composition of claim 11, wherein said 
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nucleic acid construct is present in an amount which is effective to induce cytotoxic 
T lymphocytes in said mammal. 

14. A vaccine composition for inducing immunity in a mammal 
against HIV infection comprising a pharmaceutical^ acceptable carrier and further 
comprising a therapeutically effective amount of a nucleic acid construct of Claim 1 
capable of producing HTV Gag and Pol proteins in the absence of HIV Rev 
regulatory protein in a cell in vivo . 

15. A vaccine composition for inducing immunity in a mammal 
against HTV infection comprising a pharmaceutical^ acceptable carrier and further 
comprising a therapeutically effective amount of a nucleic acid construct of Claim 2 
capable of producing HIV Pol protein in the absence of HIV Rev regulatory protein 
in a cell in vivo . 

16. A vaccine composition according to claim 14 wherein said 
mammal is a human. 

17. A vaccine composition according to claim 15 wherein said 
mammal is a human. 

18. A method for inducing immunity against HIV infection in a 
mammal which comprises administering to a mammal a therapeutically effective 
amount of a vaccine composition according to claim 14. 

1 9. A method for inducing immunity against HIV infection in a 
mammal which comprises administering to a mammal a therapeutically effective 
amount of a vaccine composition according to claim 15. 

20. A method according to claim 18 wherein said mammal is a 



human. 



human. 



21. A method according to claim 1 9 wherein said mammal is a 



22. A lentiviral expression system comprising the following: 

(a) a packaging vector comprising a HTV-1 gag/pol gene having 
the nucleotide sequence set forth in Figure 1 ; 

(b) a transfer vector; and 

(c) an envelope encoding vector. 
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23 . A transformed host cell comprising the lentiviral expression 
system of Claim 22. 

24. A transformed host cell of Claim 23 wherein said cell is a 

eukaryote. 

25. The host cell of Claim 24 wherein said cell is a human cell. 

26. A process for making a lentiviral particle comprising 
expressing HTV Gag and HIV Pol in a host cell from a vector comprising the 
nucleotide sequences encoding HIV Gag and HTV Pol set forth in Figure 1 in the 
presence of a gene encoding an envelope protein. 

27. A lentiviral expression system which is capable of 
functioning in the absence of Rev, Tat, and any viral RNA transport element 
comprising the following: 

(a) a packaging vector comprising a HTV-1 gag/pol gene which 
has been mutated to eliminate inhibitory/instability regions; 

(b) a transfer vector; and 

(c) an envelope encoding vector. 

28. A transformed host cell comprising the lentiviral expression . 
system of Claim 27. 

29. A transformed host cell of Claim 28 wherein said cell is a 

eukaryote. 

30. The host cell of Claim 29 wherein said cell is a human cell 

31. A process for making a lentiviral particle in the absence of 
Rev, Tat, or any viral RNA transport element comprising expressing HIV Gag and 
HIV Pol in a host cell from a HIV-1 gag/pol gene which has been mutated to 
eliminate inhibitory/instability regions and expressing an Envelope protein from a 
envelope encoding gene whose expression is independent Rev, Tat, or any viral 
RNA transport element. 

32. A nucleic acid construct comprising a SIV-1 env gene having 
the coding sequence of the env gene set forth in Figure 16. 

33. A vector comprising the nucleic acid construct of claim 32. 

34. A transformed host cell comprising the nucleic acid construct 

of claim 32. 
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35. A pharmaceutical composition comprising the nucleic acid 
construct of claim 32 and a pharmaceutical^ acceptable carrier. 
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FIGURE 1 

ATGGGTGCGAGAGCGTCAGTATTAAGCGGGGGAGAATTAGATCGATGGGAAAAAAT 

TCGGTTAAGGCCAGGGGGAAAGAAGAAGTACAAGCTAAAGCACATCGTATGGGCAA 

GCAGGGAGCTAGAACGATTCGCAGTTAATCCTGGCCTGTTAGAAACATCAGAAGGC 

TGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAGACAGGATCAGAGGAGCT 

TCGATCACTATACAACACAGTAGCAACCCTCTATTGTGTGCACCAGCGGATCGAGA 

TCAAGGACACCAAGGAAGCTTTAGACAAGATAGAGGAAGAGCAAAACAAGTCCAAG 

AAGAAGGCCCAGCAGGCAGCAGCTGACACAGGACACAGCAATCAGGTCAGCCAAAA 

TTACCCTATAGTGCAGAACATCCAGGGGCAAATGGTACATCAGGCCATATCACCTA 

GAACTTTAAATGCATGGGTAAAAGTAGTAGAAGAGAAGGCTTTCAGCCCAGAAGTG 

ATACCCATGTTTTCAGCATTATCAGAAGGAGCCACCCCACAGGACCTGAACACGAT 

G T T G AAC AC CGTGGGGG G AC AT C AAGC AG C C AT G C AAAT G T T AAAAG AG AC CAT C A 

ATGAGGAAGCTGCAGAATGGGATAGAGTGCATCCAGTGCATGCAGGGCCTATTGCA 

CCAGGCCAGATGAGAGAACCAAGGGGAAGTGACATAGCAGGAACTACTAGTACCCT . 

T C AG G AAC AAAT AG GAT G GAT G AC AAAT AAT CC AC C TAT C C C AG TAG GAG AG AT C T 

AC AAGAGG TGG ATAATCC TGGGAT TGAACAAGATCG TG AG GATG TAT AGCCC TACC 

AG CAT T C T G GAC AT AAGAC AAGG ACC AAAG G AAC C C T T T AGAG AC TAT G T AGAC C G 

G T T C T AT AAAAC T C T AAG AGC T GAG CAAG C T T C AC AGG AG G T AAAAAAT T GG AT G A 

CAGAAACCTTGTTGGTCCAAAATGCGAACCCAGATTGTAAGACCATCCTGAAGGCT 

CTCGGCCCAGCGGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTAGGAGG 

ACCCGGCCATAAGGCAAGAGTTTTGGCCGAGGCGATGAGCCAGGTGACGAACTCGG 

CGACCATAATGATGCAGAGAGGCAACTTCCGGAACCAGCGGAAGATCGTCAAGTGC 

TTCAATTGTGGCAAAGAAGGGCACACCGCCAGGAACTGCCGGGCCCCCCGGAAGAA 

GGGCTGTTGGAAATGTGGAAAGGAAGGACACCAAATGAAAGATTGTACTGAGAGAC 
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AGGCTAATTTTTTAGGGAAGATCTGGCCTTCCTACAAGGGAAGGCCAGGGAATTTT 
CTTCAGAGCAGACCAGAGCCAACAGCCCCACCAGAAGAGAGCTTCAGGTCTGGGGT 
AGAGACAACAACTCCCCCTCAGAAGCAGGAGCCGATAGACAAGGAACTGTATCCTT 
TAACTTCCCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAGTAAGGATCGGGG 
G G CAAC TC AAGG AAGCGC T G CT C GAT AC AG GAG CAG AT G AT ACAG TAT T AG AAG AA 
ATGAGTTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGGATCGGGGGCTTCAT 
CAAGGTGAGGCAGTACGACCAGATACTCATAGAAATCTGTGGACATAAAGCTATAG 
GTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAAATCTGTTGACC 
CAGATCGGCTGCACCTTGAACTTCCCCATCAGCCCTATTGAGACGGTGCCCGTGAA 
GTTGAAGCCGGGGATGGACGGCCCCAAGGTCAAGCAATGGCCATTGACGAAAGAGA 
AGATCAAGGCCTTAGTCGAAATCTGTACAGAGATGGAGAAGGAAGGGAAGATCAGC 
AAG AT C G G G C C T G AG AAC C C C TAC AAC AC T CCAG T C T TCGCAATCAAG AAG AAG G A 
CAG T ACC AA G T G G AGAAAG CTGG TGG AC T TC AGAG AG C T G AAC AAGAG AAC T CAG G 
ACTTCTGGGAAGTTCAGCTGGGCATCCCACATCCCGCTGGGTTGAAGAAGAAGAAG 
TCAGTGACAGTGCTGGATGTGGGTGATGCCTACTTCTCCGTTCCCTTGGACGAGGA 
C T T CAG G AAG T AC AC T G C C T T C AC GAT AC C TAG CAT CAAC AAC GAG AC AC CAG G C A 
TCCGCTACCAGTACAACGTGCTGCCACAGGGATGGAAGGGATCACCAGCCATCTTT 
CAAAGCAGCATGACCAAGATCCTGGAGCCCTTCCGCAAGCAAAACCCAGACATCGT 
GATCTATCAGTACATGGACGACCTCTACGTAGGAAGTGACCTGGAGATCGGGCAGC 
ACAGGACCAAGATCGAGGAGCTGAGACAGCATCTGTTGAGGTGGGGACTGACCACA 
CCAGACAAGAAGCACCAGAAGGAACCTCCCTTCCTGTGGATGGGCTACGAACTGCA 
TCCTGACAAGTGGACAGTGCAGCCCATCGTGCTGCCTGAGAAGGACAGCTGGACTG 
TGAACGACATACAGAAGCTCGTGGGCAAGTTGAACTGGGCAAGCCAGATCTACCCA 
GGCATCAAAGTTAGGCAGCTGTGCAAGCTGCTTCGAGGAACCAAGGCACTGACAGA 
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AGTGATCCCACTGACAGAGGAAGCAGAGCTAGAACTGGC^GAGAACCGAGAGATCC 
TGAAGGAGCCAGTACATGGAGTGTACTACGACCCAAGCAAGGACCTGATCGCAGAG 
ATCCAGAAGCAGGGGCAAGGCCAATGGACCTACCAAATCTACCAGGAGCCCTTCAA 
GAACCTGAAGACAGGCAAGTACGCAAGGATGAGGGGTGCCCACACCAACGATGTGA 
AGCAGCTGACAGAGGCAGTGCAGAAGATCACCACAGAGAGCATCGTGATCTGGGGC 
AAG AC T CC CAAG T T C AAG C T G C CC AT AC AGAAGGAG ACAT G G G AGACAT G G T G G AC 
CGAGTACTGGCAAGCCACCTGGATCCCTGAGTGGGAGTTCGTGAACACCCCTCCCT 
TGGTGAAACTGTGGTATCAGCTGGAGAAGGAACCCATCGTGGGAGCAGAGACCTTC 
TACGTGGATGGGGCAGCCAACAGGGAGACCAAGCTGGGCAAGGCAGGCTACGTGAC 
CAACCGAGGACGACAGAAAGTGGTGACCCTGACTGACACCACCAACCAGAAGACTG 
AGCTGCAAGCCATCTACCTAGCTCTGCAAGACAGCGGACTGGAAGTGAACATCGTG 
ACAGACTCACAGTACGCACTGGGCATCATCCAAGCACAACCAGACCAATCCGAGTC 
AGAGCTGGTGAACCAGATCATCGAGCAGCTGATCAAGAAGGAGAAAGTGTACCTGG 
CAT G G G T AC C AG C ACAC AAA G G AAT T G GAG G AAAT G AAC AAG TAG AT AAA T TAG T C 
AGTGCTGGGATCCGGAAGGTGCTGTTCCTGGACGGGATCGATAAGGCCCAAGATGA 
AC AT G AGAAG T AC C AC T CCAAC TGGCG CG C TATGGCC AG CG AC TTCAACCTGCC AC 
C TG TAG TAGCAAAAGAAATAG TAGCCAGC TG TGATAAATGTCAGC TAAAAGGAGAA 
GCCATGCATGGACAAGTAGACTGTAGTCCAGGAATATGGCAGCTGGACTGCACGCA 
CCTGGAGGGGAAGGTGATCCTGGTAGCAGTTCATGTAGCCAGTGGATATATAGAAG 
CAGAAGTTATCCCTGCTGAAACTGGGCAGGAAACAGCATATTTTCTTTTAAAATTA 
GCAGGAAGATGGCCAGTAAAAACAATACACACGGACAACGGAAGCAACTTCACTGG 
TGCTACGGTTAAGGCCGCCTGTTGGTGGGCGGGAATCAAGCAGGAATTTGGAATTC 
CCTACAATCCCCAATCGCAAGGAGTCGTGGAGAGCATGAACAAGGAGCTGAAGAAG 
ATCATCGGACAAGTGAGGGATCAGGCTGAGCACCTGAAGACAGCAGTGCAGATGGC 
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AGTGTTCATCCACAACTTCAAAAGAAAAGGGGGGATTGGGGGGTACAGTGCAGGGG 
AAAGGATCGTGGACATCATCGCCACCGACATCCAAACCAAGGAGCTGCAGAAGCAG 
ATCACCAAGATCCAGAACTTCCGGGTGTACTACCGCGACAGCCGCAACCCACTGTG 
GAAGGGACCAGCAAAGCTCCTCTGGAAGGGAGAGGGGGCAGTGGTGATCCAGGACA 
ACAGTGACATCAAAGTGGTGCCAAGGCGCAAGGCCAAGATCATCCGCGACTATGGA 
AAACAGATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGATTAGAACCT 
GGAAGAGCCTGGTGAAGCACCATATG (SEQUENCE ID NO:l) 
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FIGURE 2 



>wildtype 
>mutated 

#i 



TGTACAGAGA 
TGTACAGAGA 



TGGAAAAGGA 
TGGAGAAGGA 



AGGGAAAATT 
AGGGAAGATC 



TCAAAAATTG 
AGCAAGATCG 



* * +** * * 



>wildtype 
>mucated 
#41 



GGCCTGAAAA 
GGCCTGAGAA 



TCCATACAAT 
CCCCTACAAC 



ACTCCAGTAT 
ACTCCAGTCT 



TTGCCATAAA 
TCGCAATCAA 



* * + 



>wildtype 
>mutated 
#81 



GAAAAAAGAC 
GAAGAAGGAC 



AGTACTAAAT 
AGTACCAAGT 



GGAGAAAATT 
GGAGAAAGCT 



AGTAGATTTC 
GGTGGACTTC 



>wildtype 
>mutated 
#121 



AGAGAACTTA 
AGAGAGCTGA 



ATAAGAGAAC 
ACAAGAGAAC 



TCAAGACTTC 
TCAGGACTTC 



TGGGAAGTTC 
TGGGAAGTTC 



>wildtype 
>mutated 
#161 



>wildtype 
>jnutated 
#201 



AATTAGGAAT 
AGCTGGGCAT 



ACCACATCCC 
CCCACATCCC 



GCAGGGTTAA 
GCTGGGTTGA 



AAAAGAAAAA 
AGAAGAAGAA 



ATCAGTAACA 
GTCAGTGACA 



GTACTGGATG 
GTGCTGGATG 



TGGGTGATGC 
TGGGTGATGC 



ATATTTTTCA 
CTACTTCTCC 



* * + 



>wildtype 
>mutated 
#241 



GTTCCCTTAG 
GTTCCCTTGG 



ATGAAGACTT 
ACGAGGACTT 



CAGGAAATAT 
CAGGAAGTAC 



ACTGCATTTA 
ACTGCCTTCA 



>wildtype 
>mutated 
#281 



CCATACCTAG 
CGATACCTAG 



TATAAACAAT 
CATCAACAAC 



GAGACACCAG 
GAGACACCAG 



GGATTAGATA 
GCATCCGCTA 



>wildtype 
>mutated 
#321 



TCAGTACAAT 
CCAGTACAAC 



GTGCTTCCAC 
GTGCTGCCAC 



AGGGATGGAA 
AGGGATGGAA 



AGGATCACCA 
GGGATCACCA 



>wildtype 
>mutated 
#361 



GCAATATTCC 
GCCATCTTTC 



AAAGTAGCAT 
AAAGCAGCAT 



GACAAAAATC 
GACCAAGATC 



TTAGAGCCTT 
CTGGAGCCCT 



>wildtype 
>mutated 
#401 



TTAGAAAACA AAATCCAGAC ATAGTTATCT ATCAATACAT 
TCCGCAAGCA AAACCCAGAC ATCGTGATCT ATCAGTACAT 
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>wildtype GGATGATTTG TATGTAGGAT CTGACTTAGA AATAGGGCAG 

>mutated GGACGACCTC TACGTAGGAA GTGACCTGGA GATCGGGCAG 

#441 

+*++ * +* * * + * 



>wildtype CATAGAACAA AAATAGAGGA GCTGAGACAA CATCTGTTGA 

>mutated CACAGGACCA AGATCGAGGA GCTGAGACAG CATCTGTTGA 

#481 

* + * * * + 



>wildtype 
>mutated 
#521 



GGTGGGGACT TACCACACCA GACAAAAAAC ATCAGAAAGA 
GGTGGGGACT GACCACACCA GACAAGAAGC ACCAGAAGGA 



>wildtype 
>mutated 
#561 



ACCTCCATTC CTTTGGATGG GTTATGAACT CCATCCTGAT 
ACCTCCCTTC CTGTGGATGG GCTACGAACT GCATCCTGAC 



>wildtype AAATGGACAG TACAGCCTAT AGTGCTGCCA GAAAAAGACA 
>mutated AAGTGGACAG TGCAGCCCAT CGTGCTGCCT GAGAAGGACA 
#601 



>wildtype GCTGGACTGT CAATGACATA CAGAAGTTAG TGGGGAAATT 
>mutated GCTGGACTGT GAACGACATA CAGAAGCTCG TGGGCAAGTT 
#641 



>wildtype GAATTGGGCA AGTCAGATTT ACCCAGGGAT TAAAGTAAGG 
>mutated GAACTGGGCA AGCCAGATCT ACCCAGGCAT CAAAGTTAGG 
#681 



>wildtype 
>mutated 
#721 



CAATTATGTA AACTCCTTAG AGGAACCAAA GCACTAACAG 
CAGCTGTGCA AGCTGCTTCG AGGAACCAAG GCACTGACAG 



>wildtype 
>mutated 
#761 



AAGTAATACC ACTAACAGAA GAAGCAGAGC TAGAACTGGC 
AAGTGATCCC ACTGACAGAG GAAGCAGAGC TAGAACTGGC 



>wildtype AGAAAACAGA GAGATTCTAA AAGAACCAGT ACATGGAGTG 

>mutated AGAGAACCGA GAGATCCTGA AGGAGCCAGT ACATGGAGTG 

#801 ' 

* * + + + * 



>wildtype 
>mutated 
#841 



TATTATGACC CATCAAAAGA CTTAATAGCA GAAATACAGA 
TACTACGACC CAAGCAAGGA CCTGATCGCA GAGATCCAGA 
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>wildtype AGCAGGGGCA AGGCCAATGG ACATATCAAA TTTATCAAGA 

>mutated AGCAGGGGCA AGGCCAATGG ACCTACCAAA TCTACCAGGA 

#881 

* * * * * 



>wildtype GCCATTTAAA AATCTGAAAA CAGGAAAATA TGCAAGAATG 

>mutated GCCCTTCAAG AACCTGAAGA CAGGCAAGTA CGCAAGGATG 

#921 

+ + + + * ++ + + 



>wildtype AGGGGTGCCC ACACTAATGA TGTAAAACAA TTAACAGAGG 

>mutated AGGGGTGCCC ACACCAACGA TGTGAAGCAG CTGACAGAGG 

#961 

+ + * * + * * 



>wildtype CAGTGCAAAA AATAACCACA GAAAGCATAG TAATATGGGG 

>mutated CAGTGCAGAA GATCACCACA GAG AG CAT CG TGATCTGGGG 

#1001 

+ * * * + + + 



>wildtype AAAGACTCCT AAATTTAAAC TGCCCATACA AAAGGAAACA 

>mutated CAAGACTCCC AAGTTCAAGC TGCCCATACA GAAGGAGACA 

#1041 

+ + + + + + * 



>wildtype TGGGAAACAT GGTGGACAGA GTATTGGCAA GCCACCTGGA 
>mutated TGGGAGACAT GGTGGACCGA GTACTGGCAA GCCACCTGGA 
#1081 



>wildtype TTCCTGAGTG GGAGTTTGTT AATACCCCTC CTTTAGTGAA 

>mutated TCCCTGAGTG GGAGTTCGTG AACACCCCTC CCTTGGTGAA 

#1121 

+ + + *■ + + 



>wildtype 
>mutated 
#1161 



ATTATGGTAC 
ACTGTGGTAT ■ 



CAGTTAGAGA 
CAGCTGGAGA 



AAGAACCCAT 
AGGAACCCAT 



AGTAGGAGCA 
CGTGGGAGCA 



>wildtype 
>mutated 
#1201 



GAAACCTTCT 
GAGACCTTCT 



ATGTAGATGG 
ACGTGGATGG 



GGCAGCTAAC 
GGCAGCCAAC 



AGGGAGACTA 
AGGGAGACCA 



>wildtype 
>mutated 
#1241 



AATTAGGAAA 
AGCTGGGCAA 



AGCAGGATAT 
GGCAGGCTAC 



GTTACTAATA 
GTGACCAACC 



GAGGAAGACA 
GAGGACGACA 



>wildtype 
>mutated 
#1281 



AAAAGTTGTC 
GAAAGTGGTG 



ACCCTAACTG 
ACCCTGACTG 



ACACAACAAA 
ACACCACCAA 



TCAGAAGACT 
CCAGAAGACT 
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>wildtype 
>mutated 
#1321 



GAGTTACAAG 
GAGCTGCAAG 
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CAATTTATCT 
CCATCTACCT 



AGCTTTGCAG 
AGCTCTGCAA 



GATTCGGGAT 
GACAGCGGAC 



**** * 



>wilcitype 
>mutated 
#1361 



TAGAAGTAAA 
TGGAAGTGAA 



CATAGTAACA 

CATCGTGACA 



GACTCACAAT 
GACTCACAGT 



ATGCATTAGG 
ACGCACTGGG 



>wildtype 
>mutated 
81401 



>wildtype 
>mutated 
1*1441 



AATCATTCAA 
CATCATCCAA 



GCACAACCAG 
GCACAACCAG 



ATCAAAGTGA 
ACCAATCCGA 



ATCAGAGTTA 
GTCAGAGCTG 



GTCAATCAAA 
GTGAACCAGA 



TAATAGAGCA 
TCATCGAGCA 



* + + + 

GTTAATAAAA 
GCTGATCAAG 



AAGGAAAAGG 
AAGGAGAAAG 



★ * * + 



>wildtype 
>mutated 
#1481 



TCTATCTGGC 
TGTACCTGGC 



ATGGGTACCA 
ATGGGTACCA 



GCACACAAAG 
GCACACAAAG 



GAATTGGAGG 
GAATTGGAGG 



>wildtype 
>mutated 
#1521 



AAATGAACAA GTAGATAAAT TAGTCAGTGC TGGAATCAGG 
AAATGAACAA GTAGATAAAT TAGTCAGTGC TGGGATCCGG 



>wildtype 
>mutated 
#1561 



AAAGTACTAT TTTTAGATGG AATAGATAAG GCCCAAGATG 
AAGGTGCTGT TCCTGGACGG GATCGATAAG GCCCAAGATG 



>wildtype 
>mutated 
#1601 



AACATGAGAA ATATCACAGT AATTGGAGAG CAATGGCTAG 
AACATGAGAA GTACCACTCC AACTGGCGCG CTATGGCCAG 



>wildtype 
>mutated 
#1641 



TGATTTTAAC CTGCCACCTG TAGTAGCAAA AGAAATAGTA 
CGACTTCAAC CTGCCACCTG TAGTAGCAAA AGAAATAGTA 



>wildtype 
>mutated 
#1681 



GCCAGCTGTG ATAAATGTCA GCTAAAAGGA GAAGCCATGC 
GCCAGCTGTG ATAAATGTCA GCTAAAAGGA GAAGCCATGC 



>wildtype ATGGACAAGT AGACTGTAGT CCAGGAATAT GGCAACTAGA 
>mutated ATGGACAAGT AGACTGTAGT CCAGGAATAT GGCAGCTGGA 
#1721 
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>wildtype 
>mutated 
#1761 



9/43 



TTGTACACAT TTAGAAGGAA AAGTTATCCT 
CTGCACGCAC CTGGAGGGGA AGGTGATCCT 



GGTAGCAGTT 
GGTAGCAGTT 



******** * * 



>wildtype 
>mutated 
#1801 



CATGTAGCCA GTGGATATAT AGAAGCAGAA GTTATTCCAG 
CATGTAGCCA GTGGATATAT AGAAGCAGAA GTTATCCCTG 



>wildtype 
>mutated 
#1841 



CAGAAACAGG GCAGGAAACA GCATATTTTC TTTTAAAATT 
CTGAAACTGG GCAGGAAACA GCATATTTTC TTTTAAAATT 



> wild type 
>mutated 
#1881 



AGCAGGAAGA TGGCCAGTAA AAACAATACA TACAGACAAT 
AGCAGGAAGA TGGCCAGTAA AAACAATACA CACGGACAAC 



>wildtype 
>mutated 
#1921 



GGCAGCAATT TCACCAGTGC TACGGTTAAG GCCGCCTGTT 
GGAAGCAACT TCACTGGTGC TACGGTTAAG GCCGCCTGTT 



>wildtype 
>mutated 
#1961 



GGTGGGCGGG AATCAAGCAG GAATTTGGAA TTCCCTACAA 
GGTGGGCGGG AATCAAGCAG GAATTTGGAA TTCCCTACAA 



>wildtype 
>mutated 
#2001 



TCCCCAAAGT CAAGGAGTAG TAGAATCTAT 
TCCCCAATCG CAAGGAGTCG TGGAGAGCAT 



GAATAAAGAA 
GAACAAGGAG 



>wildtype 
>mutated 
#2041 



TTAAAGAAAA 
CTGAAGAAGA 



TTATAGGACA 
TCATCGGACA 



GGTAAGAGAT 
AGTGAGGGAT 



CAGGCTGAAC 
CAGGCTGAGC 



>wildtype 
>mutated 
#2081 



ATCTTAAGAC 
ACCTGAAGAC 



AGCAGTACAA 
AGCAGTGCAG 



ATGGCAGTAT 
ATGGCAGTGT 



TCATCCACAA 
TCATCCACAA 



>wildtype 
>mutated 
#2121 



TTTTAAAAGA 
CTTCAAAAGA 



AAAGGGGGGA 
AAAGGGGGGA 



TTGGGGGGTA 
TTGGGGGGTA 



CAGTGCAGGG 
CAGTGCAGGG 



>wildtype GAAAGAATAG TAGACATAAT AGCAACAGAC ATACAAACTA 

>mutated GAAAGGATCG TGGACATCAT CGCCACCGAC ATCCAAACCA 

#2161 

* + * * * * * * * 
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>wiidtype 
>mutated 
#2201 



>wildtype 
>mutated 
#2241 



>wildtype 
>mutated 
#2281 



>wildtype 
>mutated 
#2321 



>wildtype 
>mutated 
#2361 



>wildtype 
>mutated 
82401 



>wildtype 
>mutated 
#2441 



10/43 

AAGAATTACA AAAACAAATT ACAAAAATTC AAAATTTTCG 
AGGAGCTGCA GAAGCAGATC ACCAAGATCC AGAACTTCCG 

+ *+ + + *** + ** ** + 

GGTTTATTAC AGGGACAGCA GAAATCCACT TTGGAAAGGA 
GGTGTACTAC CGCGACAGCC GCAACCCACT GTGGAAGGGA 

* * ++ +*+ - * * 

CCAGCAAAGC TCCTCTGGAA AGGTGAAGGG GCAGTAGTAA 
CCAGCAAAGC TCCTCTGGAA GGGAGAGGGG GCAGTGGTGA 



+ * + * * 

TACAAGATAA TAGTGACATA AAAGTAGTGC CAAGAAGAAA 
TCCAGGACAA CAGTGACATC AAAGTGGTGC CAAGGCGCAA 



+ + ++ * + *+ + 

AGCAAAGATC ATTAGGGATT ATGGAAAACA GATGGCAGGT 
GGCCAAGATC ATCCGCGACT ATGGAAAACA GATGGCAGGT 



GATGATTGTG TGGCAAGTAG ACAGGATGAG GATTAGAACA 
GATGATTGTG TGGCAAGTAG ACAGGATGAG GATTAGAACC 



TGGAAAAGTT TAGTAAAACA CCATATG 
TGGAAGAGCC TGGTGAAGCA CCATATG 
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FIGURE 3 

ATGGGCGTGAGAAACTCCGTCTTGTCAGGGAAGAAAGCAGATGAATTAG 

AAAAAATTAGGCTACGACCCAACGGAAAGAAAAAGTACATGTTGAAGC 

ATGTAGTATGGGCAGCAAATGAATTAGATAGATTTGGATTAGCAGAAAG 

CCTGTTGGAGAACAAAGAAGGATGTCAAAAAATACTTTCGGTCTTAGCT 

CCATTAGTGCCAACAGGCTCAGAAAATTTAAAAAGCCTTTATAATACTG 

TCTGCGTCATCTGGTGCATTCACGCAGAAGAGAAAGTGAAACACACTGA 

GGAAGCAAAACAGATAGTGCAGAGACACCTAGTGGTGGAAACAGGAAC 

CACCGAAACCATGCCGAAGACCTCTCGACCAACAGCACCATCTAGCGGC 

AGAGGAGGAAACTACCCAGTACAGCAGATCGGTGGCAACTACGTCCAC 

CTGCCACTGTCCCCGAGAACCCTGAACGCTTGGGTCAAGCTGATCGAGG 

AGAAGAAGTTCGGAGCAGAAGTAGTGCCAGGATTCCAGGCACTGTCAG 

AAGGTTGCACCCCCTACGACATCAACCAGATGCTGAACTGCGTTGGAGA 

CCATCAGGCGGCTATGCAGATCATCCGTGACATCATCAACGAGGAGGCT 

GCAGATTGGGACTTGCAGCACCCACAACCAGCTCCACAACAAGGACAA 

CTTAGGGAGCCGTCAGGATCAGACATCGCAGGAACCACCTCCTCAGTTG 

ACGAACAGATCCAGTGGATGTACCGTCAGCAGAACCCGATCCCAGTAGG 

CAACATCTACCGTCGATGGATCCAGCTGGGTCTGCAGAAATGCGTCCGT 

ATGTACAACCCGACCAACATTCTAGATGTAAAACAAGGGCCAAAAGAG 

CCATTTCAGAGCTATGTAGACAGGTTCTACAAAAGTTTAAGAGCAGAAC 

AGACAGATGCAGCAGTAAAGAATTGGATGACTCAAACACTGCTGATTCA 

AAATGCTAACCCAGATTGCAAGCTAGTGCTGAAGGGGCTGGGTGTGAAT 

CCCACCCTAGAAGAAATGCTGACGGCTTGTCAAGGAGTAGGGGGGCCG 

GGACAGAAGGCTAGATTAATGGCAGAAGCCCTGAAAGAGGCCCTCGCA 

CCAGTGCCAATCCCTTTTGCAGCAGCCCAACAGAGGGGACCAAGAAAGC 

CAATTAAGTGTTGGAATTGTGGGAAAGAGGGACACTCTGCAAGGCAATG 

CAGAGCCCCAAGAAGACAGGGATGCTGGAAATGTGGAAAAATGGACCA 

TGTTATGGCCAAATGCCCAGACAGACAGGCGGGTTTTTTAGGCCTTGGT 

CCATGGGGAAAGAAGCCCCGCAATTTCCCCATGGCTCAAGTGCATCAGG 

GGCTGATGCCAACTGCTCCCCCAGAGGACCCAGCTGTGGATCTGCTAAA 

GAACTACATGCAGTTGGGCAAGCAGCAGAGAGAAAAGCAGAGAGAAAG 

CAGAGAGAAGCCTTACAAGGAGGTGACAGAGGATTTGCTGCACCTCAAT 

TCTCTCTTTGGAGGAGACCAGTAG 
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FIGURE 4 

SIV gag . 

#1 

ATGGGCGTGAGAAACTCCGTCTTGTCAGGGAAGAAAGCAG 

SIV gag 

#41 

ATGAATTAGAAAAAATTAGGCTACGACCCAACGGAAAGAA 

SIV gag 

#81 

AAAGTACATGTTGAAGCATGTAGTATGGGCAGCAAATGAA 

SIV gag 

#121 

TTAGATAGATTTGGATTAGCAGAAAGCCTGTTGGAGAACA 

SIV gag 

#161 

AAGAAGGATGTCAAAAAATACTTTCGGTCTTAGCTCCATT 

SIV gag 

#201 . ' 

AGTGCCAACAGGCTCAGAAAATTTAAAAAGCCTTTATAAT 

SIV gag 

#241 

ACTGTCTGCGTCATCTGGTGCATTCACGCAGAAGAGAAAG 

SIV gag 

SIVgagDX . . 

#281 

T GAAAC AC AC T GAG G AAG CAAAAC AGAT AG T GC AG AG AC A 

SIV gag — A — A • — T A — A 

SIVgagDX. . C--C C G — G 

#321 

CCTAGTGGTGGAAACAGGAACMACMGAAACYATGCCRAAR 

SIV gag — AAG-A ' 

SIVgagDX . . --CTC-C 

#361 

ACMWSTMGACCAACAGCACCATCTAGCGGCAGAGGAGGAA 
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SIV gag -T A — A — A T 

SIVgagDX. .-C G--G--C C 

#401 

AYTACCCAGTACARCARATMGGTGGTAACTAYGTCCACCT 

SIV gag T-AAG AT -A — T — C A — AT — 

SIVgagDX . . C-GTC CC-G — C — T C — GC — 

#441 

GCCAYTRWSCCCGAGAACMYTRAAYGCYTGGGTMAARYTG 

SIV gag —A A A--T 

SIVgagDX. C G G— C 

#481 

ATMGAGGARAAGAARTTYGGAGCAGAAGTAGTGCCAGGAT 

SIV gag -T T f— 

SIVgagDX. . -C C C— 

#521 

T Y CAG G CAC T G T C AGAAG G T T G CACCC C C TAYG ACAT YAA 

SIV gag T T-A--T--T--G A 

SIVgagDX . .C C-G— C— C--T G 

#561 

YCAGATGYTRAAYTGYGTKGGAGACCATCARGCGGCTATG 

SIV gag T A-A--T--T--A — 

SIVgagDX. . C C-T--C--C-- C 

#601 

CAG AT Y AT CMG W G A Y AT Y ATMAAC GAG GAG G C T G CAG AT T 

SIV gag 

SIVgagDX. . 

#641 

GGGACTTGCAGCACCCACAACCAGCTCCACAACAAGGACA 

SIV gag T — T A — T 

SIVgagDX. . C— C C--C 

#681 

ACTTAGGGAGCCGTCAGGATCAGAYATYGCAGGAACMACY 

SIV gag AGT A--T A A- A — A- 

SIVgagDX. .TCC T--C G C-T — G- 

#721 

WSYTCAGTWGAYGAACARATCCAGTGGATGTACMGWCARC 
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SIV gag C — A T — -A-GA 

SIVgagDX. . G--C C— C-TC 

#761 

AGAACCCSATMCCAGTAGGCAACATYTACMGKMGATGGAT 

SIV gag A GT A--A--T— CA-A T A 

SIVgagDX . . G TC G--G--C— TOT C G 

#801 

CCARCTGGGKYTGCARAARTGYGTYMGWATGTAYAACCCR 

SIV gag ~A 

SIVgagDX. C 

#841 

ACMAACATTCTAGATGTAAAACAAGGGCCAAAAGAGCCAT 

SIV gag 

#881 

T T C AG AG C T AT G TAG AC AG G T T C T ACAAAAG T T TAAGAG C 

SIV gag 

#921 

AGAACAGACAGATGCAGCAGTAAAGAATTGGATGACTCAA 

SIV gag 

#961 

AC AC TGC TG AT TC AAAAT GC TAACCC AG AT TGC AAG CTAG 

SIV gag , . 

#1001 

TGCTGAAGGGGCTGGGTGTGAATCCCACCCTAGAAGAAAT 

SIV gag 

#1041 

GCTGACGGCTTGTCAAGGAGTAGGGGGGCCGGGACAGAAG 

SIV gag 

#1081 

GCTAGATTAATGGCAGAAGCCCTGAAAGAGGCCCTCGCAC 

SIV gag 

#1121 

CAGTGCCAATCCCTTTTGCAGCAGCCCAACAGAGGGGACC 

SIV gag 

#1161 

AAGAAAGCCAATTAAGTGTTGGAATTGTGGGAAAGAGGGA 
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SIV gag 

#1201 

CACTCTGCAAGGCAATGCAGAGCCCCAAGAAGACAGGGAT 



SIV gag 

#1241 

GCTGGAAATGTGGAAAAATGGACCATGTTATGGCCAAATG 



SIV gag 

#1281 

CCCAGACAGACAGGCGGGTTTTTTAGGCCTTGGTCCATGG 

SIV gag 

#1321 

GGAAAGAAGCCCCGCAATTTCCCCATGGCTCAAGTGCATC 

SIV gag 

#1361 

AGGGGCTGATGCCAACTGCTCCCCCAGAGGACCCAGCTGT 

SIV gag 

#1401 

GGATCTGCTAAAGAACTACATGCAGTTGGGCAAGCAGCAG 

SIV gag 

#1441 

- AGAGAAAAGCAGAG AGAAAGCAGAGAGAAGCC T TACAAGG 

SIV gag 

#1481 

AGGTGACAGAGGATTTGCTGCACCTCAATTCTCTCTTTGG 

SIV gag 

#1521 

AGGAGACCAGTAG 
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FIGURES 
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FIGURE 6 
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FIGURE 7 




BssHII (711) 
Clal (830) 
Accl (959) 



Clal (3259) 

Xhol (3548) 
Apal (3557) 
Kpn! (3563) 
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FIGURE 8 



BssHII (711) 




Kpnl (3563) 
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FIGURE 9 



1 


CCTGGCCATT GCATACGTTG 


TATCCATATC 


ATAATATGTA CATTTATA1T GGCTCATGTC 


CAACATTACC 


71 


GCCATGTTGA 


CATTGA7TAT 


TGACTAGTTA 


TTAATAGTAA 


TCAATTACGG GGTCATTAG7 


TCATAGCCCA 


141 


TATATGGAGT TCCGCGTTAC 


ATAACTTACG 


GTAAATGGCC 


CGCCTGGCTG ACCGCCCAAC 


GACCCCCGCC 


211 


CATTGACGTC 


AATAATGACG 


TATGTTCCCA 


TAGTAACGCC 


AA7AGGGAC7 TTCCATTGAC 


GTCAA7GGGT 


2B1 


GGAGTATTTA 


CGGTAAACTG 


CCCACTTGGC 


AGTACATCAA GTCTATCATA TGCCAAGTAC 


GcccccTA'rr 


351 


GACGTCAATG 


ACGGTAAATG 


OCCCGCCTGG 


CATTATGCCC 


AGTACATGAC CTTATGGGAC 


TTTXlVxACTT 


421 


GGCAGTACAT CTACGTATTA 


GTCATCGCTA 


TTACCATGGT GATGCGGTTT TGGCAGTACA 


TCAATGGGCG 


491 


TGGATAGCGG 


TTTGACTCAC 


GGGGATTTCC 


AAGTCTCCAC 


CCCATTGACG TCAATGGGAG 


TTTGTnTGG 


561 


CACCAAAATC 


AACGGGACTT 


TCCAAAATGT 


CGTAACAACT CCGCCCCATT GACGCAAATG 




631 


GTGTACGGTG 


GGAGGTCTA? 


ATAAGCAGAG 


C7CGTTTAGT 


GAACCGTCAG ATCGCCTGGA 


GACGCCATCC 


701 


ACGCTGTTTT 


GACCTCCATA 


GAAGACA CCG 


GGACCGATCC 


Sail (75f 

AGCCTCCGCG GGCGCGCGTC GACAGAGAGA 


771 


TGGGTGCGAG 


AGCGTCAGTA 


TTAAGCGGGG 


GAGAATTAGA 


TCGATGGGAA AAAATTCGGT 


TAAGGCCAGG 


841 


GGGAAAGAAG 


AAG T AC AAGC 


TAAAGCACAT 


CGTATGGGCA 


AGCAGGGAGC TAGAACGATT 


CGCAGTTAAT 


911 


CCTGGCCTGT 


TAGAAACATC 




AGACAAATAC 


TGGGACAGCT ACAACCATCC 


CTTCAGACAG 


?31 


GATCAGAGGA 


GCTTCGATCA 


CTATACAACA 


CAGTAGCAAC 


CCTCTATTGT GTGCACCAGC 


GGATCGAGAT 


1051 


CAAGGACACC 


AAGGAAGCTT 


TAGACAAGAT 


A.GAGGAAGAG 


CAAAACAAGT CCAAGAAGAA 


GGCCCAGCAG 


1121 


GCAGCAGCTG 


ACACAGGACA 


CA\j \_AATCAG 


GTCAGCCAAA 


ATTACCCTAT AGTGCAGAAC 


ATCCAGGGGC 


1191 


AAATGGTACA 


TCAGGCCATA 


TCwvCCTAGAA 


CTT7AAATGC 


ATGGGTAAAA GTAGTAGAAG 


AGAAGGCTTT 


1261 


CAGCCCAGAA 


GTGATACCCA 


tgttttcagc 


ATTATCAGAA 


GGAGCCACCC CACAGGACCT 


GAACACGATG 


1331 


TTGAACACCG 


TGGGGGGACA 


TCAAQCAGCC 


ATGGAAATGT 


TAAAAGAGAC CATCAATGAG 


GAAGCTGCAG 


1401 


AATGGGATAG 


AGTGCATCCA 


GTGCATGCAG 


GGCCTATTGC 


ACCAGGCCAG ATGAGAGAAC 


CAAGGGGAAG 


1471 


TGACATAGCA 


GGAACTACTA 


GTACCCTTCA 


GGAA.CAA.VTA 


GGATGGATGA CAAATAATCC 


ACCTATCCCA 


1541 


GTAGGAGAGA 


TCTACAAGAC 


GTGGATAATC 


CTGGGATTGA 


ACAAGATCGT GAGGATGTAT 


AGCCCTACCA 


1611 


GCATTCTGGA CATAAGACAA 


GGACCAAAGG 


AACCCTTTAG 


AGACTATGTA GACCGGTTCT 


ATAAAACTCT 


1681 


AAGAGCTGAG 


CAAGCTTCAC 


AGGAGGTAAA 


AAATTGGATG 


ACAGAAACCT TGTTGGTCCA 


AAATGCGAAC 


1751 


CCAGATTGTA 


AGACCATCCT 


GAAGGCTCTC 


GGCCCAGCGG 


CTACACTAGA AGAAATGATG 


ACAGCATGTC 


1821 


AGGGAGTAGG 


AGGACCCGGC 


CATAAGGCAA 


GAG7TTTGGC 


CGAGGCGATG AGCCAGGTGA 


CGAACTCGGC 



1891 GACCA7AATG ATGCAGAGAG G2AACTTCCG GAACCAGCGG AAGATCGTCA AGTGCTTCAA TTGTGGCAAA 
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1961 GAAGGGCACA CCGCCAGGAA CTGCCGGGCC CCCCGGAAGA AGGGCTGTTG GAAATGTGGA AAGGAAGGAC 
2031 ACCAAATGAA AGATTGTACT GAGAGACAGG CTAATTTTTT AGGGAAGATC TGGCCTTCCT ACAAGGGAAG 
2101 GCCAGGGAAT TTTCTTCAGA GCAGAOCAGA GCCAACAGCC CCACCAGAAG AGAGCTTCAG GTCTGGGGTA 
2171 GAGACAACAA CTCCCCCTCA GAAGCAGGAG CCGATAGACA AGGAACTGTA TCCTTTAACT TCCCTCAGAT 
2241 CACTCTTTGG CAACGACCCC TCGTCACAGT AAGGATCGGG GGGCAACTCA AGGAAGCGCT GCTCGATACA 
2311 GGAGCAGATG ATACAGTATT AGAAGAAATG A GTTTCCCAG GAAGA7GGAA ACCAAAAATG ATAGGGGGGA 
2381 TCGGGGGCTT CATCAAGGTG AGGCAGT ACG ACCAGATACT CATAGAAATC TGTGGACATA AAGCTATAGG 
2451 TACAGTATTA GTAGGACCTA CACCTOTCAA CATAATTGGA AGAAATCTGT TGACCCAGAT CGGCTGCACC 
2521 TTGAACTTCC CCATCAGCCC TATTGAGACG GTGCCCGTGA AGTTGAAGCC GGGGATGGAC GGCCCCAAGG 
2591 TCAAGCAATC GCCATTGACG AAAGAGAAG A TCAAGGCCTT AGTCGAAATC TGTACAGAGA TGGAGAAGGA 
2661 AGGGAAGATC AGCAAGATCG GGCCTGAGAA CCCCTACAAC ACTCCAGTCT TCGCAATCAA GAAGAAGGAC 
2731 AGTACCAAGT GGAGAAAGCT GGTGGACTTC AGAGAGCTGA ACAAGAGAAC TCAGGACTTC TGGGAAGTTC 
2801 AGCTGGGCAT CCCACATCCC GCTGGGTTGA AGAAGAAGAA GTCAGTGACA GTGCTGGATG TGGGTGATGC 
2371 CTACTTCTCC GTTCCCTTGG ACGAGGACTT CAGGAAGTAC ACTGCCTTCA CGATACCTAG CATCAACAAC 
2941 GAGACACCAG GCATCCGCTA CCAGT ACAAC GTGCTGCCAC AGGGATGGAA GGGATCACCA GCCATCTTTC 
3011 AAAGCAGCAT GACCAAGATC CTGGAGCC CT TCCGCAAGCA AAACCCAGAC ATCGTGATCT ATCAGTACAT 
3081 GG.^CGACCTC TACGTAGGAA GTGACCTGGA GATCGG GCAG CACAGGACCA AGATCGAGGA GCTGAGACAG 
3151 CATCTGTTGA GGTGGGGACT GACCACACCA GACAAGAAGC ACCAGAAGGA ACCTCCCTTC CTGTGGATGG 
3221 GCTACGAACT GCATCCTGAC AAGTGGACAG TGCAGCCCAT CGTGCTGCCT GAGAAGGACA GCTGGACTGT 
32 91 GAACGACATA CAGAAGCTCG TGGGCAAG TT GAACTGGGCA AGCCAGATCT ACCCAGGCAT CAAAGTTAGG 
3351 CAGC7GTGCA AGCTGCTTCG AGGAACCAAG GCACTG ACAG AAGTGATCCC ACTGACAGAG GAAGCAGAGC 
3431 TAGAACTGGC AGAGAACCGA GAGATCCTGA AGGAGCCAGT ACATGGAGTG TACTACGACC CAAGCAAGGA 
3501 C^TCATCGCA GAGATCCAGA AGCAGGGGCA AGGCCAATCG ACCTACCAAA TCTACCAGGA GCCCTTCAAG 
3571 AACCTGAAGA CAGGCAAGTA CGCAAGGATG AGGGGTGCCC ACA CCAACGA TGTGAAGCAG CTGACAGAGG . 
3641 C^QTGCAGAA GATCACCACA GAGAGCATCG T GATCTGGGG CAAGACTCCC AAGTTCAAGC TGCCCATACA 
3711 GAAGGAGACA TGGGAGACAT GGTGGACCGA GTACT GGCAA GCCACCTGGA TCCCTGAGTG GGAGTTCGTG 
3781 AACACCCCTC CCTTGGTGAA ACTGTGGTAT CAGCTG GAGA AGGAACCCAT CGTGGGAGCA GAGACCTTCT 
3851 ACGTGG -' tTGG GGCAGCCAAC AGGGAGACCA AGCTGG GCAA GGCAGGCTAC GTGACCAACC GAGGACGACA 
3921 G-^CTSGTG ACCCTGACIG ACACCACCAA CCAGAAGACT GAGCTGCAAG CCATCTACCT AGCTCTGCAA 
3991 GACAGCGGAC TGGAAGTGAA CATCGTGACA GACTCACAGT ACGCACTGGG CATCATCCAA GCACAACCAG 
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4061 ACCAATCCGA GTCAGAGCTG GTCAACCAGA TCATCGAGCA GCTGATCAAG AAGGAGAAAG TGTACCTGGC 



4131 


ATGGGTACCA GCACACAAAG 


GAATTGGAGG AAATGAACAA GTAGATAAAT TAGTCAGTGC 


TGGGATCCGG 


4201 


AAGGTGC7GT TCCTGGACGG 


GATCGATAAG GCCCAAGATG AACATGAGAA GTACCACTCC AACTGGCGCG 


4271 


CTATGGCCAG CGACTTCAAC 


CTGCCACCTG TAGTAGCAAA AGAAATAGTA GCCAGCTGTG 


ATAAATGTCA 


4341 


GCTAAAAGGA GAAGCCATGC 


ATGGACAAGT AGACTGTAGT CCAGGAATAT GGCAGCTGGA 




4411 


CTGGAGGGGA AGGTGATCCT GGTAGCAGTT CATGTAGCCA GTGGATATAT AGAAGCAGAA 


CTTATCCCTG 


4481 


CTGAAACTGG GCAGGAAACA 


GCATATTTTC TTTTAAAATT AGCAGGAAGA TGGCCAGTAA 


AAACAATACA 


4551 


CACGGACAAC GGAAGCAACT 


TCACIGGTGC TACGGTTAAG GCCGCCTGTT GGTGGGCGGG 


AATCAAGCAG 


4621 


GAATTTC3GAA TTCCCTACAA 


TCCCCAATCG CAAGGAGTCG TGGAGAGCAT GAACAAGGAG 


CTGAAGAAGA 


4691 


TCATCGGACA AGTGAGGGAT 


CAGGCTGAGC AOCTGAAGAC AGCAGTGCAG ATGGCAGTGT 


TCATCCACAA 


4761 


CTTCAAAAGA AAAGGGGGGA 


TTGGGGGGTA CAGTGCAGGG GAAAGGATCG TGGACATCAT 


CGCCACCGAC 


4831 


ATCCAAACCA AGGAGCTGCA 


GAAGCAGATC ACCAAGATCC AGAACTTCCG GGTGTACTAC 


CGCGACAGCC 


4901 


GCAACCCACT GTGGAAGGGA 


CCAGCAAAGC TCCTCTGGAA GGGAGAGGGG GCAGTGGTGA 


TCCAGGACM 


4971 


CAGTGACATC AAAGTGGTGC 


CAAGGCGCAA GGCCAAGATC ATCCGCGACT ATGGAAAACA 


GATGGCAGGT 


5041 


GATGATTGTG TGGCAAGTAG 


ACAGGATGAG GATTAGAACC TGGAAGAGCC TGGTGAAGCA 


CCATATGGCG 




Nhel (5117) 





BstBI (5111) 

5111 TTCGAAGCTA GCCTCGAGAT CCAGATCTGr AGTTGCCAGL* CATCTGTTGT TTGCCCCTCC 

5181 CCCGTGCC7T CCTTGACCCT GGAAGG7GCC AC7CCCACTG TCCTTTCCTA ATAAAATCAG GAAATTGCAT 



5251 


CGCATTGTCT 


GAGTAGG7GT 


CATTCTATTC 


T^o _■ GG"-> 3Tu^ 


GGTKj GGGCAG 


CACAGCAAGG 


GGGAGGATTG 


5321 


GGL-lAGACAAT" 


AGC mGGCATG 


C . oGGGA"P3r 


GG . ov_r3'CTCT 


ATGGCTAC " 


AGGTGCTGAA 


GAATTGACCC 


5391 


GGTTCCTCCT 


G\iG C**IA»j.*w% 


GAAGCAGGCA 


v.r..v.CwC I IC 


TCTCTGACAC 


ACCCTGTCCA 


CGCCCCTGGT 


5461 


TCT7AGTTCC 


agccccactc 




CA7AGCTCAG 


GAGGGC7CCC 


CCTTCAATCC 


CACCCGCTAA 


5531 


AGTACT7CGA 


GCC-GTCTCTC* 


CCTCCCTCAT 


C AC C CCACCA 




AGCCTCCAAG 


AGTGGGAAGA 


5601 


AATTAAAGCA 


/vGnTAvJGC T>» 


T7AAGTGCAG 


AGG 3AGAGAA 


AATCCCTCCA 


ACATGTGAGG 


AAGTAATGAG 



5671 AGAAATCA7A GAATTICTIC CGCTTCCTCG CTCACIGACT CGCTGCGCTC GGTCGTICGG CTGCGGCGAG 



5741 CGGTATCAGC TCACTCAAAG GCGGTAATAC GGTTATCCAC AGAATCAGGG GATAACGCAG GAAAGAACAT 
5811 GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA CCGTAAAAAG GCCGOGTTGC TGGCGTTTTT CCATAGGCIC 
5881 CGCCCCCCTG ACGAGCATCA CAAAAATCGA CGCTCAAGTC AGAGCTGGCG AAACCCGACA GGACTATAAA 
5951 GATACCAGGC GTTTCCCCCT GGAAGCTCCC TOGTGCGCTC TCCTGTTCCG ACCCTGCCGC TTACCGGATA 
6021 CCTGTCCGCC TTTCTCCCTT CGGGAAGCGT GGCGCTTTCT CAATGCTCAC GCTGTAGGTA TCTCAGTTCG 
6091 GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTCCACGAAC CCCCCGTTCA GCCCGACCGC TGCGCCTTAT ■ 
6161 CCGGTAACTA TCGTCTTGAG TCCAACCCGG TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA 
6231 CAGGATTAGC AGAGCGAGGT ATGTAGGCGG TGCTACAGAG TTCTTGAAGT GGTGGCCTAA CTACGGCTAC 
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6301 ACTAGAAGGA CAGTATTIGG TATCTGCGCT CIGCTGAAGC CAGTTACCTT CGGAAAAAGA GTTGGTAGCT 
6371 CTTCATCCGG CAAACAAACC ACCGCTGGTA GCGG7GGTIT TTTTCTTTGC AAGCAGCAGA TTACGCGCAG 
6441 AAAAAAAGGA TCTCAAGAAG ATCCTTIGAT CTTTTCTACG GGGTCTGACG CTCAGTGGAA CGAAAACTCA 
6511 CGTTAAGGGA TTTTGGTCAT GAGATTATCA AAAAGGATCT TCACCTAGAT CCTTTTAAAT TAAAAATGAA 
6581 GTCTTAAATC AATCTAAAGT ATATATGAGT AAACTTCGTC TGACAGTTAC CAATGCTTAA TCAGTGAGGC 
6651 ACCTATCTCA GCGATCTGTC TATTTCGTTC ATCCATAGTT GCCTGACTCC GGGGGGGGGG GGCGCTGAGG 
6721 TCTGCCTCGT GAAGAAGGTG TTGCTGACTC ATACCAGGCC TGAATCGCCC CATCATCCAG CCAGAAAGTG 
6791 AGGGAGCCAC GGTTGATGAG AGCTTTCnTG TAGGTGGACC AGTTGGTGAT nTGAACTTT TGCTITCCCA 
6861 CGGAACGGTC TGCGTTGTCG GGAAGATGCG TGATCTGATC CTTCAACTCA GCAAAAGTTC GATTTATTCA 
6931 ACAAAGCCGC CGTCCCGTCA AGTCAGOGTA ATCCTCTGCC AGTGTTACAA CCAATTAACC AATTCIGATT 
7001 AGAAAAACTC ATCGAGCATC AAATGAAACT GCAATTTATT CATATCAGGA TTATCAATAC CATATTTTTG 
271<PhePheGJ u AspLeuMetL euHfsPheGI nLeuLysAsn MetAspProA snAsplleGI yTyrLysGIn 
7071 AAAAAGCCGT TTCTGTAATG AAGGAGAAAA CTCACCGAGG CAGTTCCATA GGATGGCAAG ATCCTGGTAT 
248< PheLeuArgL ysGlnLeuSe rProSerPhe GluGlyLeuC ysAsnTrpLe ulleAlaLeu AspGI nTyrA 
7141 CGGTCTGCGA TTOCGACTCG TCCAACATCA ATACAACCTA TTAATTTCCC CTCGTCAAAA ATAAGGTTAT 
224<rgAspAI ail eGlyValArg GlyValAspl leCysGlyll ©LeuLysGly GluAspPhel leLeuAsnAs 
7211 CAAGTGAGAA ATCACCATGA GTGACGACTG AATCCGGTGA GAATGGCAAA AGCTTATGCA TITCTTTCCA 
201<pLeuSerPhe AspGI yHisT hrValValSe rAspProSer PheProLeuL euLysHisNte tGluLysTrp 
7281 GACTTGTTCA ACAGGCCAGC CATTACGCTC GTCATCAAAA TCACTCGCAT CAACCAAACC GTTATTCATT 
178< VaJGInGluV alProTrpGI yAsnArgGlu AspAspPfteA spSerAlaAs pValLeuGly AsnAsnfcbtA 
7351 CGTGATTGCG CCTGAGCGAG ACGAAATACG CGATCGCTGT TAAAAGGACA ATTACAAACA GGAATCGAAT 
154<rgSerGI nAI aGI nAI aLeu ArgPheValA rgAspSerAs nPheProCys AsnCysValP rolleSerHI 
7421 GCAACCGGCG CAGGAACACT GCCAGCGCAT CAACAATATT TTCACCTGAA TCAGGATATT CTTCTAATAC 
131<$LeuArgArg LeuPheValA laLeuAlaAs pVallleAsn GluGlySerA spProTyrGI uGluLeuVal 
7491 CTGGAATGCT GTTTTCCCGG GGATCGCAGT GGTGAGTAAC CATGCATCAT CAGGAGTACG GATAAAATGC 
108<GlnPheAlaT hrLysGlyPr olleAlaThr ThrLeuLeuT rpAlaAspAs pProThrArg MePheHisL 
7561 TTGATGGTCG GAAGAGGCAT AAATTCCGTC AGCCAGTTTA GTCTGACCAT CTCATCTGTA ACATCATTGG 
64<ysl leThrPr oLeuProMet PheGluThrL euTrpAsnLe uArgValhtet GluAspThrV atAspAsnAI 
7631 CAACGCTACC TTTGCCATGT TTCAGAAACA ACTCTGGCGC ATCGGGCTTC CCATACAATC GATAGATTGT 
61«aVal$erGly LysGlyHisL ysLeuPheLe uGluProAfa AspProLysG lyTyrLeuAr gTyrlleThr 
7701 CGCACCTGAT TGCCCGACAT TATCGCGAGC CCATTTATAC CCATATAAAT CAGCATCCAT GTrGGAATIT 
38<AlaGlySer G InGlyValAs nAspArgAla TrpLysTyrG lyTyrLeoAs pAlaAspKtet AsnSerAsnL 
7771 AATCGCGGCC TCGAGCAAGA CGTTTCCCGT TGAATATGGC TCATAACACC CCTTGTATTA CTCTITATGT 

K<euArgProAr gSerCysSer ThrGluArgG InlleHisSe rWfet 
7841 AAGCAGACAG TTTTATTCTT CATGATGATA TATTTTTATC TTGTGCAATG TAACATCAGA GATCTIGAGA 
7911 CACAACGTGG CTTTCCCCCC CCCCCCATTA TTGAAGCA7T TATCAGGGTT ATTGTCTCAT GAGCGGATAC 
7981 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCTG 
8051 ACGTCTAAGA AACCATTATT ATCATGACAT TAACCTATAA AAATAGGCGT ATCACGAGGC CCTTTCGTCT 
8121 CGCGCGTTTC GGTGATGACG GTGAAAACCT CTGACACATG CAGCTCCCGG AGACGGTCAC AGCTTGTCTG 
8191 TAAGCGGATG CCGGGAGCAG ACAAGCCCGT CAGGGCGCGT CAGCGGGTGT TGGCGGGTGT CGGGGCTGGC 
B261 TTAACTATGC GGCATCAGAG CAGATTGTAC TGAGAGTGCA CCATATGCGG TGTGAAATAC CGCACAGATG 
8331 CGTAAGGAGA AAATACCGCA TCAGATTGGC TATTGG 
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FIGURE 10 



1 


TGGAAGGGCT AA'lTiUVlUJ CAAAAAAGAC AAGAGATOCT 7GATCTGTGG ATCTACCACA CACAACGCTA 


71 


CTTCCCTCAT TGGCAGAACT ACACACCAGG GCCAGGGA7C AGATATCCAC TGACCTTTGG 


ATGGTGCTTC 


141 


AAGTTAGTAC CAGTTGAACC AGAGCAAGTA GAAGAGGCCA AATAAGGAGA GAAGAACAGC 


TTGTTACACC 


211 


CTATGAGCCA GCA7GGGA7G GAGGACCCGG AGGGAGAAGT ATTAGTCTGG AA37TTGACA 


GCCTCCTAGC 


281 


ATTTCGTCAC ATGGCCCGAG AGCTGCATCC GGAGTACTAC AAAGACTGCT GACATCGAGC 


TTTCTACAAG 


351 


GGACTTTCCG CTGGGGACTT TCCAGGGAGG TG7GGCCTGG GCGGGACTGG GGAGTGGCGA 


GCCCTCAGAT 


421 


GCTACATATA AGCAGCTGCT TITIGCCTGT ACIUCJUTCTC TCIX5GTTAGA CCAGATCTGA 


GCCTGGGAGC 


491 


TCTCTGGCTA ACTAGGGAAC CCACTGCTTA AGCCTCAATA AAGCTTGCCT TGAGTGCTCA AAGTAGTGTG 


561 


TGCCCGTCTG TTGTGTGACT CTGGTAACTA GAGATCCCTC AGACCCl'l'lT AGTCAGTGTG 


GAAAATCTCT 


631 


AGCAGTGGCG CCCGAACAGG GACTTGAAAG CGAAAGTAAA GCCAGAGGAG ATCTCTCGAC 


GCAGGACTCG 


701 


BssHIl (711) 

GCTTGCTGAA GCGCGTacog caazzgzzca crgggrg^cgC c^aATragGa cgrc-asaaai 





Clal (830) 

■771 cseg==:asa ac=a=a=»=C rcOTGCa-j AS^7C^37A rr.-.^C733C- GA3^T7A3k TCGATCGGAA 



841 










AAAA7TCGG7 


7AAGSCCAGG 


GGGAAAGAAA AA.n7A7.™7 


7AAAA2A7A7 A37A7GGGCA AGCAGGGAGC 


911 








Accl (959) 


7AGAACGAT7 


CGCAGT7AA7 


CC7*GGCr?37 7A3.-.-A--.7C 


A3AA3GCTG7 AGnCAAVTAC TGGGACAGCT 


981 


ACAi.CCATCC 


CTTCAGACA3 


GATCAGAAGA AC77AGA7CA 


7TA7A7AA7A CAGTAGCAAC CCTCTATTGT 


1051 


GTGCA7CAAA 


GGA?AGAG=i7 


AAAAGACACr AAGGAAGC77 


7AGACAAGAT AGAGGAAGAG C.AAAACAAAA 


1121 


G7AAGAAAAA 


AGCACAGCAA 


GCAG~-.G77G ACACAGGACA 


CAGCAATCAG GTCAGCCAAA ATTACCCTAT 


1191 


AG7GCAGAAC 


A7CCAGGGGC 


AAATGGTAGA TCAGG3CATA 


TCACC7AGAA CTTTAAACGA TAAGCTTGGG 


1261 


AG7TCCGCG7 


7ACATAAC7T ACGG7AAATG GCCCGC2TGG 


fr, 

CTGACCGCCC AACG^CCCCC GCCCATTGAC 


1331 


GTCAATAA7G 


ACG7A7GTTC 


CCATAG7AAC GCCAA7A 


ACI^ivt-Axr GACGTCAATG GG7GGAGTAT 


1401 


TTACGGTAAA 


CTGCGCACTT 


GGCAGTACAT CAAGTGTATC 


ATATGCCAAG TACGCCCCCT ATTGACGTCA 


1471 


ATGACGGTAA ATGGCCCCCC 


TGGCATTATG CCCAGTACAT GACCTTATGG GAC7TTCCTA CTTGGCAGTA 


1541 


CA^.ACGTA 


TTAGTCATCG 


C7AT7ACCAT GGTGATGCGG 


TTTTGGCAGT ACATCAATGG GCGTGGATAG 


1611 


CGGTTTGACT 


CACGGGGA7T 


TCCAAGTCTC CACCCCATTG 




ieai 


A7CAACGGGA 


C.'i'^CrtAAA TGTCGTAACA ACTCCGCCCC 


ATTGACGCAA ATGGGCGGTA GGCGTGTACG 


1751 


GTGGGAGGTC 


7ATATAAGCA 


GAGrrCGTTT AGTGAACCGT CAGATCGCCT GGAGACGCCA TCCACGCTGT 
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1821 TTTGACCTCC ATAGAAGACA CCGACTCTAG AGgarccATC TAAGTAAGCT TGGCATTCCG GTACTGTTGG 



1891 TAAAATGGAA GACGCCAAAA ACATAAAGAA AGSCCCG3CG CCATTCTATC CTCTAGAGGA TGGAACCGCT 
1961 GGAGAGCAAC TGCATAAGGC TATGAAGAGA TACGCCCTGG TTCCTGGAAC AnTTGCTTTT ACAGATGCAC 



2031 ATATCGAGGT GAACATCACG TACGCGGAAT ACTTCGAAAT GTCCGTTCGG TTGGCAGAAG CTATGAAACG 
2101 ATATGGGCTC AATACAAATC ACAGAATCGT OnA TGCAGT GAAAACTTIC TTCAATTCTT TATGCCGGTG 
2171 TTGGGCGCGT TATTTATCGG AGTTGCAGT7 GCGCCCGCG A ACGACATTTA TAATGAnCGT GAATTGCTCA 
2241 ACAGTATGAA CATTTCGCAG CCTACCGT AG TGTTTGTTTC CAAAAAGGGG TTGCAAAAnA TTTTGAACGT 
2311 GCAAAAAAAA TTACCAVTAA TCCAGAAAnT TATTATCATG GATTCTAAAA CGGAT7ACCA GGGATTTCAG 
2381 TCGATGTACA CGTTCGTCAC ATCTCATCTA CCT CCCGGTT TTAVTGAATA CGATTTTGTA CCAGAGTCCT 
2451 TTGATCGTGA CAAAACAATT GCACTGATAA 7K ATTCCTC TGGATCTACT GGGTTACCTA AGGGTGTGGC 
2521 CCTTCCGCAT AGAACTGCCT GCGT CAGATT CTCGCATGCC AGAGATCCTA TTTTTGGCAA TCAAATCATT 
2591 CCGGATACTG CGATTTTAAG TGTTGTTCCA , TTC CATCACG GTTTTGGAAT GTTTACTACA CTCGGATATT 
2661 TGATATGTGG ATTTCGAGTC GTC TTAATGT ATAGATTTGA AGAAGAGCTG TTTTTACGAT CCCTTCAGGA 
2731 ITACnAA^TT- CAAA3TSCGT TGCTAGTACC AACC CTATTT TCnTTCTTCG CCA^-.GCAC TCTGATTGAC 
2801 ^TACGATT TATCTAATTT ACACGA nATT GCTTCTGGGG GCGCACCTCT TTCGAAAGAA GTCGGGGAAG 
2871 CGGTTGCAAA ACGCTTCCAT CTTCCnGGGA TACG.-C.AnGG ATATGG GCTC ACTGAGACTA CATCAGCTAT 
2941 TCTGATTACA CCCGAGGGGG ATGATAA-.CC GGGCGCGGTC GGTA-AG TTG TTCCATTTTT TGAAGCGAAG 
3011 G ^G1^-TC TGGATACCGG GAAAACGCTG GGCGTTAATC AGAGAGGCGA ATTATGTGTC AGAGGACCTA 
3081 TC1 " TTATG - C3GTTAT3TA AnCA-.TCCGG AAGCGACCA A CGCCTTGATT GACAAGGATG GATGGCTACA 
3151 ™OSftGK: ATAOCTTACT GGGACGAAGA CG.-ACACTTZ TTCATAGTTG ACCGCTTGAA GTCT TTAATT 

Clal (3259) 

3221 ^TACAAAG GATATCAGGT CCCCCCCGCT GAA.TT03.AnT CGATATTGTT A CAACACCCC AACATCTTCG 
3291 ^SO^GCGT GGCAGGTCTT CCCGACGATG A CGCCGGTGA ACTTCCCGCC GCCGTTGTTG TTTTGGAGCA 
5361 CG ^nAGACG ATGACGGAAn AAGAGATCGT GGATTACGTC GCC AGTCAAG TAACAACCGC GAAAAAGTTG 
3431 CGCGGAGa2>G -ntnt;TTTGT GGACGAAGTA CCGA-AGSTC TTACCGGAA A ACTCGACGCA AGAAAAATCA 

Apal (3557) 

^ cataaaggcc aagaaoggco caa^caa Knc J£°!J££ ccS^crJ 35 " 

3571 TAAGACCAAT GACTTACAAG GCAGCTGTAG ATCTTAGCCA CmTTAAAA GAAAAGGGGG GACTGGAAGG 
3641 GCTAATTCAC 7CCZAAAGAA GACAAGATAT CC7TGATCTG TGGATCTACC ACACACAAGG CTACTICCCT 
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3711 GATXGGCAGA ACTACACACC AGGGCCAGGG GTCAGATATC CACTGACCTT TGGATGGTGC TACAAGCTAG 

3781 TACCAGTTGA GCCAGATAAG GTAGAAGAGG CCAATAAAGG AGAGAACACC AGCTICTTAC ACCCTGTGAG 

3851 CCT3CATGGA ATGGATGACC CTGAGAGAGA AGTGTTAGAG TGGAGGTTTG ACAGCCGCCT AGCATTTCAT 

3921 CACGTGGCCC GAGAGCTGCA TCCGGAGTAC TTCAAGAACT GCTGACATCG AGCTTGCTAC AAGGGACTTT 

3991 CCGCTGGGGA CTTTCCAGGG AGGCGTGGCC TGGGCCGGAC TGGGGAG7GG CGAGCCCTCA GATGCTGCAT 

^0^1 A7AAGCAGCT GCTTTTTGCC TGTACTGGGT CTCTCTCGTT AGACCAGATC TGAGCCTGGG AGCTCTCTGG 

4131 CTAACTAGGG AACCCACTCC TTAAGCCTCA ATAAAGCTTG CCITGAGTGC TTCAAGTAGT GTCTCCCCGT 

■ ■ — ► 

<201 CTCTTOliriU ACTCTGGTAA CTAGAGATCC CTCAGACCCT TTTAGTCAGT GTGGAAAATC TCTAGCACCC 

4271 CCCAGGAGGT AGAGGTTOCA GTGAGCCAAG ATCGCGCCAC TGCATTCCAG CCTGGGCAAG AAAACAAGAC 
4341 TGTCTAAAAT AATAATAATA AGTTAAGGGT ATTAAATATA TITATACATG GAGGTCATAA AAATATATAT 
4411 A TTTGG GCTG GGCGCAGTGC CTCACACCTG CGCCCGGCCC TTIGGGAGGC CGAGGCAGGT GGATCACCTG 
4481 AGTTTGGGAG TTCCAGACCA GCCTGACCAA CATGGAGAAA CCCCTTCTCT GTGTATmT AGTAGATTTT 
4551 ATTTTATGTG TATTTTATTC ACAGGTATTT CTGGAAAACT GAAACTCTTT TTCCTCTACT CTGATACCAC 
4621 AAGAATCATC AGCACAGAGG AAGACTICIU TGATCAAATG TGGTGGGAGA GGGAGGTTTT CACCAGCACA 
4691 TGAGCAGTCA GTTCTGCCGC AGACTCGGCG GGTGTCCTTC GGTTCAGTTC CAACACCGCC TGCCTGGAGA 
4761 GAGGTCAGAC CACAGGGTGA GGGCTCAGTC CCCAAGACAT AAACACCCAA GACATAAACA CCCAACAGGT 
4831 CCACCCCGCC TGCTGCCCAG GCAGAGCCGA TTCACCAAGA CGGGAATTAG GATAGAGAAA GAGTAAGTCA 
4901 CACAG AGCCG GCTGTGCGGG AGAACGGAGT TCTATTATGA CTCAAATCAG TCTCCCCAAG CATTCGGGGA 
4971 T E^ G TIIE TAAGGATAAC TTAGTGTGTA GGGGGCCAGT GAGTTGGAGA TGAAAGCGTA GGGAGTCGAA 
5041 GGTCTCCTTT TGCGCCGAGT CAGTTCCTGG GTGGGGGCCA CAAGATCGGA TGAGCCAGTT TATCAATCCG 
5111 GGGGTGCCAG CTGATCCATG GAGTGCAGGG TCTGCAAAAT ATCTCAAGCA CTGATTGATC TTAGGTTTTA 
5161 CAATAGTGAT GTTACCCCAG GAACAATTTG GGGAAGGTCA GAATCTTGTA GCCTGTAGCT GCATGACTCC 
5251 TAAACCATAA TTTCTTTTTT GTTTmTIT UTTTATTTTT GAGACAGGGT CTCACTCTGT CACCTAGGCT 
5321 GGAGTGCAGT GGTGCAATCA CAGCTCACTG CAGCCCCTAG AGCGGCCGCC ACCGCGGTGG AGCTCCAATT 
5391 CGCCCTATAG TGAGTCGTAT TACAATTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG 
5461 CGTTACCCAA CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC 
5531 ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGCGCGA AATTCTAAAC GTTAATATTT 
5601 TGTTAAAAT? CGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA TCGGCAAAAT 
5671 CCCTTATAAA TCAAAAGAAT AGACCGAGAT AGGGTTGAGT GTTGTTCCAG TTIGGAACAA GAGTCCACTA 
5741 TTAAAGAACG TCGAC TCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG CGATGGCCCA CTACGTGAAC 
5811 CATCACCCTA ATCAAGTTTT TTGGGGTCG& GGTGCCGTAA AGCACTAAAT CGGAACCCTA AAGGGAGCCC 
5B81 CCGATTTAGA GCTTGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG 
5951 GGCGCTAGGG CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC 
6021 CGCTACAGGG CGCGTCCCAG GTGGCACTIT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTITIC 
6091 TAAATACATT CAAATATCTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA 
° GGAAG AGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTmTT GCGGCATTCT GCCTTCCTGT 
5231 TTITGCTCAC CCAGAAACGC TGGTGAAACT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC 
ATC GAACT Gg ATCTCAACAG CGGTAAGATC CTTGAGAG1T TTCGCCCCGA AGAACGTTTT CCAATGATGA 
6371 GCACTnTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG 
6441 CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC 
6=11 ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA 
III 1 CAAOGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCAIGTAA CTCGCCTTGA 
6651 TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATCCC ToTAGCAATG 
6721 GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT 
6791 GGATCGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA 
6561 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC 
t'l; CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA 
j^OOl iAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
^071 AAAACTTCAT TOAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT 
L^T i/v " CGTCA GT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 
U.l 7TTTTCTGCG CGTAATCTGC TGCITCCAAA CAAAAAAACC ACCGCTACCA GCGGTCGTTT GTTTGCCGGA 
Hl TCA " GAGCTA CCAACTCTIT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT 
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7351 CTAGTCTAGC CGTAGTTAGG CCACCACTIC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 
7421 TCCTGTTACC AGTGGCTGCT GCCAGTCGCG ATAAGTCGTG TCTTACCGGG TTCGACTCAA GACGATAGTT 
7491 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGUjbbTlU, TGCACACAGC CCAGCTIGGA GCGAACGACC 
7561 TACACCGAAC TGAGATACCT ACAGCGTCAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG 
7631 ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
7701 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTCTCATC CTCGTCAGGG 
7771 GGGCGGAGCC TATG GAAAAA CGCCAGCAAC GCGGCCTTIT TACGGTTCCT GGCCTTTTCC TGGCCTTTTC 
7841 CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTIG AGTCAGCTCA 
7911 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCCAATA 
7981 CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA CGACAGGTTT CCCGACTGGA 
8051 AAGCGGGCAG 7GAGCGCAAC GCAATTAATG TGAGTTAGCT CACTCATTAG GCACCCCAGG CTTTACACTT 
8121 TATGCTICCG GCTCGTATGT TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATCAC 
8191 CATGATTACG CCAAGCTCGG AATTAACCCT CACTAAAGGG AACAAAAGCT GCTGCAGGGT CCCTAACTGC 
B261 CAAGCCCCAC AGTGTGCCCT GAGGCTGCCC CTTCCTTCTA GCGGCTGCCC CCACTCGGCT TTGCTTTCCC 
8331 TAGTTTCAGT TACTTGCGTT CAGCCAAGGT CTGAAACTAG GTGCGCACAG AGCGGTAAGA CTGCGAGAGA 
8401 AAGAGACCAG CTTTACAGGG GGTTTATCAC AGTGCACCCT GACAGTCGTC AGCCTCACAG GGGGTTTATC 
8471 ACATTGCACC CTGACA GTCG TCAGCCTCAC AGGGGGTTTA TCACAGTGCA CCCTTACAAT CATTCCATIT 
8541 GATTCACAAT TTnTTAGTC TCTACTGTGC CTAACTICTA AGTTAAATIT GATCAGAGGT G TCTTCCCAG 
8611 AGGGGAAAAC AGTATATACA GGGTTCAGTA CTATCGCATT TCAGGCCTCC ACCTGGGTCT TCGAATCTCT 
8681 CCCCCGAGGG GTGATGACTA CCTCAGTTGG A1CTCCACAG GTCACAGTGA CACAAGATAA CCAAGACACC 
8751 TCCCAAGGCT ACCACAATGG GCCGCCCTCC ACGTGCACAT GGCCGGAGGA ACTGCCATGT CGGAGGTGCA 
6821 AGCACACCTG CGCATCAGAG TCCTTGGTGT GGAGGGAGGG ACCAGCGCAG CTTCCAGCCA TCCACCTGAT 
8891 GAACAGAACC TAGGGAAAGC CCCAGTTCTA CITACACCAG GAAAGGC 
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FIGURE 11 

1 rcGAAGGGCT AATTTGGTCC CAAAAAAGAC AAGAGA7CCT TGAT CIC7CG A7CIACCACA CACAAGGCTA 
71 CTTCCCTGAT TGGCAGAACT ACACACCAG3 GCCAGGGATC AGAT ATCCAC TGACC777GG ATGGTGCTTe 
141 AAGTTAGTAC CAG77GAACC AGAGCAAGTA GAAGAGGCCA AATAA GGAGA GAAGAACAGC TTGTTACACC 
2U CTATCAGCCA GCATGGGATG GAGGACCOGG AGGGAGAAGT ATTAGTGTGG AAGTTTGACA GCCTCCTAGC 
281 ATTTCGTCAC A7GGCCCGAG AGCTGCATCC GGACTACTAC AAAGAC 7GC7 GACATCGAGC 7TTCTACAAG 
351 G^ACTITCCG CTTGGGGACTT TCCAGGGAGG 7GTGGCCTGG GQGGGACTGG GGAGTGGCGA GCCCTCAGAT 
421 GCTACATATA AGCAGCTGCT TTT7GCCTGT AC7GGGTC7C TCTGGTTAGA CCAGATCTGA GCCTQGGAGC 
491 ^riCTGGCTA ACTAGGGAAC CCACTGCTTA AGCCTCAATA AAGCTTGCCT 7G AGTGCTCA AAG7AGTGTC 
551 TGCCCGTCTC TTGTGTGACT CTGGTAACTA GAGATCCCTC AGACCCTTT7 AGTCA GTGTG GAAAATCTCT 
631 AGC 7 rG ^ CCCGAACAG^GACTTGAAAG CGAAAGTAAA GCCAGAGGAG ATCTCTCGAC GCAGGACTCG 

BssHil (711) 

701 GCTIGCTGAA GCSJXacg? caagaggcca cg ? gc^gC c^XpaffGa ccrcaa^- ^rgacracc 



77i — ~ - ~ ~ C,aI < 830 > 



sgasagasC ^OTGGSG PZC:T?C-~A 7TA.^C^ GLAaAATTA^-. TCGATGGGAA 
841 AAAAT7CGG7 7AAGGCCAGG GGGAAAGAAG AAG7AGAAGC TAAAGCA2A7 CG7A7GG32A AGCAGGGAGC 



511 


TAa-.AC2.Arr CGCAG77AA7 




Accl (959) 


Cr7^2C7o. 7AGAAACATC 


AGAAGG7TGT 


AGACAAA7AC TGGGACAGCT 


981 


ACAACCA7C7. C7TCAGACAG 








GA7CAGAG3A G7T7CGA7CA 


C7A7ACAACA 


CAG7AGCAAC CCTCTATTGT 


1051 


GTGCACCAGC GGATCGAGA7 


CAAGGACACC AAGGAA3C77 


7AGACAAGA7 


AGAGGAAGAG CAAAACAAGT 


1121 


CCAAGAAGAA GGCCCAGCAG 


G3AGCAG77G ACACAGGACA 


CAGCAATCAG 


G7CAGCCAAA ATTACCCTAT 


1191 


A3TGCAGAAC ATCCAGGGGC 


AAA7GGTAGA 7CAGGCCA7A 


7CACC7AGAA 


C7TTAAACGA 7AAGCTTGGG 


1261 


AGTTCCGCG7 TACA7AACTT 


ACG37AAATG GCCCGCCTGG 


CTGACCGCCC 




AACGACCCCC GCCCATTGAC 


1331 


GTCAATAATG ACGTATGT7C 


CCATAGTAAC GCCAA7AGGG 


ACTTTCCATT 


GACGTCAATG GGTGGAG7A7 


1401 


T7ACGGTAAA CTGCCCACTT 


GGCAGTACAT CAAGTGTATC 


ATATGCCAAG 


TACGCCCCCT ATTGACGTCA 


1471 


A7GACGGTAA ATGGCCCGCC 


TGGCATTATG CCCAGTACAT 


GACCTTA7GG 


GAC7T7CC7A CTTGGCAGTA 


1541 


CATC7ACG7A 7TAG7CATCG 


CTATTACCAT GGTGATGCGG 
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1611 CSGT7TCACT CACGGCGATT TCCAAGTCTC CACCCCATTC ACCTCAATGG GACTTTGTrr TGGCACCAAA 
1681 ATCAACGGSA CTITCCAAAA TGTCGTAACA ACTCCGCC" A rJGACSCAA ATCG3CGGTA GGCGTGTACG 
1751 GTGGGAGGTC 7ATATAAGCA GAGCTCGTTT AGTGAACCGT CAGATCGC CT GGAGAC5CCA TCCACGCTC7 
1821 TTTGACCTCC ATAGAAGACA CCGACTCTAG ^G satccATC TAAGTAAG— TSGCAT^CG GTACTG7T3G 
1891 TAA AATGGAA GACGCCAAAA ACATAAAGAA AGGCCCGSCG CCATTCTATC CTC7AGAGGA TGGAACCGC7 
1961 GGAGAGCAAC TGCATAAGGC TATGA AGAGA 7ACGCCCTGG TTCCTGGAAC AATTGCTTTT ACAGATGCAC 
2031 ATATCGAGGT GAACATCACG TACGCGGAAT ACTTCGAAAT GTCCGTTCGG TTGGCAGAAG C TATCAAACG 
2101 ATATGGGCTG AA7ACAAATC ACAGAATCGT CSTATGCAGT GAAAACTCTC T TCAATTCTT TATGCCGGTG 
2171 TTGGGCGCGT TATTTATCGG AGTTGCAGTT GCCCCCGCGA ACGACA7TTA TAATGAACGT GAATTGCT CA 
22<1 ACAGTATGAA CATTTCGCAG CCTACCGTAG TC r iTO.Tn: CAAAAA OGGG TTCCAAA.AAA TTTTGAACGT 
2311 GCAAAAAAAA TTACCAATAA TCCAGAAAAT TATTATCATC G ATTCTAAAA CGGATTACCA GGSATITCAG 
2381 TCGA7GTACA CGTTCGTCAC ATCTCATCTA CCTCCCGGTT TT AATGAATA CGATTTTGTA CCAGAGTCCT 
24S1 TTCATCGTGA CAAAACAATT GCACTGATAA TGAATTCCTC TGGATCTACT GGGTTACCTA AGGGTCTGGC 
2521 CCTTCCGCA7 AGAACTCCCT GCGTCAGATT CTCGCATGCC AGAGATCCTA TT7TT GGCAA TCAAATCATT 
2591 CCGal ' TACTC CGAnTTAAG T3TTGTTCCA TTCCATCACG GTTTTGGAAT GTTTA CTACA CTCGGATATT 
2651 ^""TGTGG ATTTCGAGTC GTCTTAATCT ATAGATTTGA AGAAG AGCTG TTTTTACGAT CCCTTCAGGA 
2731 TaaiWB CAAAGTGCGT TGCTAGTACC AACCC7ATT7 TCAT7CTTCG CCAAAAGCAC TCTGATTCAC 
2801 ^TACGATT TATCTAATTT ACACGAAAT7 GCTTCTGGGG GCSCACCTCT TTCGA AAGAA GTCGGGGAAG 

2871 CCGTTSCAAA acccwaaT cttccaggga tacgacaagg atatgggctc actgagacta catcagct at 

2941 CCCGAGGGGG ATGATAAACC 0GGC3COGTC GGTAAAG7TC TTCCA7TTTT TGA AGCGAAG 

30U ^^GGA^ TGGATACCGG GAAAACGCTG GGCoTTAATC AGAGAGGCGA ATTATGT GTC AGAGGACCTA 

3081 ^^"TCTC CGGTTATCTA AACAATCC5G AAGCGACCAA CGCCTTGATT GACAA GGATG GATGGCTACA 

3151 T: ' CTCS ^ ATAGCTTACT GGGACGAAGA CGAAGACTTC TTCATAGTTG ACCGrTTGAA GTCTTTAATT 

Clal (3259) 

3221 ~^ TACAAAG ^-"TCAGGT GGCCCCCGrr GAA7TGGAAT CGATATTGTT AGAACACCCC AACATCTTCG 
3291 "^^"^ GGCAGGTCTT CCCGACGATC ACGCCGGTGA ACTTCCCGCC GCCGTTGTTG TTTTGGAGCA 
3361 CG3;j ^CG ATGACGGAAA AAGAGATCGT GGATTACGTC GCCAGTCAAG TAA CAACCGC GAAAAAGTTG 
3<31 "ggKSafi TTGTG7CTGT G3ACGAAGTA CCGAAAGGTC TTACCGGAAA ACTCGACGCA AGAAAAATCA 
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Apal (3557) 
Xhol (3548) Kpnl (3563) 

3501 GAGAGATCCT CATAAAGGCC AAGAAGGGCG GAAAGTCCAA ATTGTAAcTC GAGGGGGGGC CCGGTACCTT 


3571 TAAGACCAAT GACTTACAAG GCAGCTGTAG ATCTTAGCCA CTTTTTAAAA GAAAAGGGGG GACTCGAAGG 



3641 GCTAATTCAC TCCCAAAGAA GACAAGATAT CCTTGATCTG TGGATCTACC ACACACAAGG CTACTTCCCT 

3711 GATPGGCAGA ACTACACACC AGGGCCAGGG GTCAGATATC CACTGACCTT 7GGATGGTGC TACAAGCTAG 

3781 TACCAGTTGA GCCAGATAAG GTAGAAGAGG CCAATAAAGG AGAGAACACC AGC7TGTTAC ACCCTGTGAG 

3851 CCTGCATGGA ATGGATGACC CTGAGAGAGA AGTGTTAGAG TGGAGGTTTG ACAGCCGCCT AGCATTTCAT 

3921 CACGTGGCCC GAGAGCTGCA TCCGGAGTAC TTCAAGAACT GCTGACATCG AGCTTGCTAC AAGGGACTTT 

3991 CCGCTGGGGA CTTTCCAGGG AGGCGTGGCC TGGGCGGGAC TGGGGAGTGG CGAGCCCTCA GATGCTGCAT 

4061 ATAAGCAGCT GCTTTTTGCC TGTACTGGGT CTCTC7GGTT AGACCAGATC TGAGCCTGGG AGCTCTCTGG 
fr» . 

4131 CTAACTAGGG AACCCACTGC TTAAGCCTCA ATAAAGCTTG OCTTGAGTCC TTCAAGTAGT GTGTGCCCGT 

■ ■ »» 

4201 CTGTTGTGTG ACTCTGGTAA CTAGAGATCC CTCAGACCCT TTTAGTCAGT GTGGAAAATC TCTAGCACOC 

— 

4271 CCCAGGAGGT AGAGGTTGCA GTGAGCCAAG ATCGCGCCAC TGCATTCCAG CCTGGGCAAG AAAACAAGAC 
4341 TGTCTAAAAT AATAATAATA AGTTAAGGGT ATTAAATATA TTTATACATG GAGGTCATAA AAATATATAT 
4411 ATTTGGGCTG GGCGCAGTGG CTCACACCTG CGCCCGGCCC TTTGGGAGGC CGAGGCAGGT GGATCACCTG 
4481 AGTTTGGGAG TTCCAGACCA GCCTGACCAA CATGGAGAAA CCCCTTCTCT GTGTATTTTT AGTAGATTTT 
4551 ATTTTATGTG TATTTTATTC ACAGGTATTT CTGGAAAACT GAAACTGTTT TTCCTCTACT CTGATACCAC 
4621 AAGAATCATC AGCACAGAGG AAGACTTCTG TGA7CAAATG TGGTGGGAGA GQGAGGTnT CACCAGCACA 
4691 TGAGCAGTCA GTTCTGCCGC AGACTCGGCG GGTGTCCTTC GGTTCAGTTC CAACACCGCC TGCCTGGAGA 
4761 GAGGTCAGAC CACAGGGTGA GGGCTCAGTC CCCAAGACAT AAACACCCAA GACATAAACA CCCAACAGGT 
4831 CCACCCCGCC TGCTGCCCAG GCAGAGCCGA TTCACCAAGA CGGGAATTAG GATAGAGAAA GAGTAAGTCA 
4901 CACAGAGCCG GCTGTGCGGG AGAACGGAGT TCTATTATGA CTCAAATCAG TCTCCCCAAG CATTCGGGGA 
4971 TCAGAGTTTT TAAGGATAAC TTAGTGTCTA GGGGGCCAGT GAGTTGGAGA TGAAAGCGTA GGGAGTCGAA 
5041 GGTGTCCTTT TGCGCCGAGT CAGTTCCTGG GTGGGGGCCA CAAGATCGGA TGAGCCAGTT TATCAATCCG 
5111 GGGGTGCCAG CTGATCCATG GAGTGCAGGG TCTGCAAAAT ATCTCAAGCA CTGATTGATC TTAGGTnTA 
5181 CAATAGTGAT GTTACCCCAG GAACAATTTG GGGAAGGTCA GAATCTTGTA GCCTGTAGCT GCATGACTCC 
5251 TAAACCATAA TTTCTnTIT GTTrTTTTTT TTTTATTTTT GAGACAGGGT CTCACTCTGT CACCTAGGCT 
5321 GGAGTGCAGT GGTGCAATCA CAGCTCACTG CAGCCCCTAG AGCGGCCGCC ACCGCGGTGG AGCTCCAATT 
5391 CGCCCTATAG TGAGTCGTAT TACAATTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG 
5461 CGTTACCCAA CTTAATCGCC TTGCAGCACA TCCCCCTITC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC 
5531 ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGCGCGA AA7TGTAAAC GTTAATATTT 
5601 TGTTAAAATT CGCGTTAAAT TnTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA TCGGCAAAAT 
5671 CCCTTATAAA TCAAAAGAAT AGACCGAGAT AGGGTTGAGT GTTGTTCCAG TTTCGAACAA GAGTCCACTA 
5741 TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG CGATGGCCCA CTACGTGAAC 
5811 CATCACCCTA ATCAAGTnT TTGGGGTCGA GGTGCCGTAA AGCACTAAAT CGGAACCCTA AAGGGAGCCC 
5881 CCGATTTAGA GCTTGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG 
5951 GGCGCTAGGG CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC 
6021 CGCTACAGGG CGCGTCCCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTinC 
6091 TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTCAAAAA 
6161 GGAAGAGTAT G^GTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTITTT GCGGCATTIT GCCTTCCTGT 
6231 TITTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TCGCTGCACG AGTGGGTTAC 
6301 ATC GAACT GG ATCTCAACAG CCGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA 
6371 GCACTTTTAA AGTTCTGCTA TG7GGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG 
6441 CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC 
6511 ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA 
6581 CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA 
6631 TCG7TGGGAA CCGGAGCTGA A7GAAGCCAT ACCAAACGAC GAGCGTGACA CCACGA1XGCC TGTAGCAATG 
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6721 GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT 
6791 GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC OXLIT1UJ U GCTOGCTCGT TTATTGCTGA 
6861 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCACCACTGG GGCCAGATCG TAAGCCCTCC 
6931 CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA 
7001 TAG GTGCCT C ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
7071 AAAACTTCAT 1TITAATTIA AAAGGATCTA GGTGAAGATC CTTITTGATA ATCTCATGAC CAAAATCCCT 
7141 TAACGTGA GT TTICCnTCCA CTCAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 
7211 VTTTTCTGCG CGTAATCTGC TGC7TGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA 
7281 TCAAGAGCTA CCAACTCT3T TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACT3TCCTT 
7351 S TAGIGTASC CGTAGTrAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 
7421 TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCG1G TCTCACCGGG TTGGACTCAA GACGATAGTT 
7491 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTIGGA GCGAACGACC 
7561 TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG 
7631 ACA GGTAT CC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGC GAAACGCCTG 
7701 GTATCTITAT AGTCCIGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTITGTGATG CTCGTCAGGG 
7771 GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCITIT TACGGTTCCT GGCCTTITO: TCGCCTITrc 
7841 CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTOGGA TAACCGTATT ACCGCCTTIG AGTCAGCIGA 
7911 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCCAATA 
7981 CGCAAACCGC CTCTTCCCCGC GCGTK3GCCG A1TCATTAAT GCAGCTGGCA CGACAGGTIT CCCGACIGGA 
8051 AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT CACTCATTAG GCACCCCAGG CTTTACACTT 
8121 TATGCTIXXG GCTCGTATGT TCTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC 
8191 CATGATTACG CCAAGCTCGG AATTAACCCT CACTAAAGGG AACAAAAGCT GCTCCAGGGT CCCTAACTGC 
8261 CAAGCCCCAC AGTGTGCCCT GAGGCTGCCC CTTCCTTCTA GCGGCTGCCC CCACTCGGCT TIUCITIXX C 
8331 TAGTTTCAGT TACTIGCGTr CAGCCAAGGT CTGAAACTAG GTGCGCACAG AGCGGTAAGA CTGCGAGAGA 
8401 AAGAGACCAG CTTTACAGGG GGTTTATCAC AGTGCACCCT GACAGTCGTC AGCCTCACAG GGGGTTTATC 
8471 ACATIGCACC CTGACA GTCG TCAGCCTCAC AGGGGGTTTA TCACAGTGCA CCCTTACAAT CATICCATIT 
8541 GATTCACAAT TTTITTAGTC TCTACTGTGC CTAACTTGTA AGTTAAATTT GATCAGAGGT GTGTTCCCAG 
3611 AGGGGAAAAC AGTATATACA GGGTTCAGTA CTATGGCATT TCAGGCCTCC ACC1GGGTCT TGGAATGTGT 
8581 CCCCCGAGGG GTGATGACTA CCTCAGTTGG ATCTCCACAG GTCACAGTGA CACAAGATAA CCAAGACACC 
6/51 TCCCAAGGCT ACCACAATGG GCCGCCCTCC ACGTGCACAT GGCCG5AGGA ACTGCCATGT CGGAGGTGCA 
5S21 AGCACACCTG CGCATCAGAG TCCTTGGTGT GGAGGGAGGG ACCAGCGCAG CTTCCAGCCA TCCACCIGAT 
5891 GAACAGAACC TAGGGAAAGC CCCAGTTCTA CTTACACCAG GAAAGGC 
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FIGURE 12 
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FIGURE 13 
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FIGURE 14 



Reverse transcriptase activity from the Rev- 
Independent gag-pol HIV-1 vector pCMVBNkan 
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FIGURE ISA 
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FIGURE 15B 
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FIGURE 15C 



; Transduction of myeloid cells (U937) 
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FIGURE 15D 



Transduction of human primary macrophages 
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FIGURE 16 
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FIGURE 17 

1 CCTGGCCATT TATCCATATC ATAATA7GTA CATTTftTOrT G32TCATCTC CAACATTOCC 

71 GCCATSTTGA CATTGATTAT TGfiCTOSTTA T7AATACTAA TCAATTACGG GOTCATT&GT? TCATAGCOCA 

141 TATAIOGAOT TCCGCCTTTAC ATAACTTACG GTAAATGGCC CGCTIGOCTG AOTOCCCAAC GAOCCCX3aCC 

211 CATTGACGTC AATAATCACG TWP&TTCOCA TAGTAACGOC AATAGGGACT TOXATTCAC GTCAATOQCT 

281 GGAGTMTTA OSGTAAACTG CXXACTTOGC AOTftCA r tfCAA <7FGTATCATA TO^CAACTFAC GCCCCCTOTT 
" WB *" l>,,WB,B,a, * a " ttl>mM ^ TOOIi,1 " a,a,!m "*" 

351 GACCTCAATC ACGCTA&ATC GCCCGOCIGG C^TTOTGCCC AG'£v>CA r i<GAC CTEATGGGAC TTTCCTACTT 

421 GGCAGTAGAT CTACGTATTA glCATCGCTA TTAO^PGGT GATGCGGTTT TGGCAGmCA TCAATGGGCO 
491 TGGATAGCGG TTI^ACFCAC GGGGATITCC AAGTCTCCAC CCCATPGACG TCAATOGGAG TTTOPTTTOQ 
561 CACCAAAATC AACGGGACTT TCC&AAATCT CGTCAAC&ACT CCGCCCCAIT GACGCAAATC GGCOGTAGGC 
631 GTCTACGGTC CTCAGGICTAT ATAAGCAGAG CTOTJTrACT GAACCCTCAG ATCOCC7GGA OACOCCA'TCC 

BssHII (7S2) 
Sacll (746) 

701 ACGCTGTTTTT CffiCCKXATA GAfiCX^CaCCG CK5^(^TOC AGCCTCCGCX? C &OGCQCTAA GTAT^GGATG 

771 TCTTGGGAAT CRGCTGC1TA TCGCCATCTT CCTITTAACTr OTCTATGGGA TOIWITCEfcC TC^TATCT^ 
3>> L G N OLL I A I L L L 8 V Y G IYCT LYV*** 
841 ACACnCOTTT ATCGTCTACC AGCTPGGAGG AATC5CGACAA TPC'CCCTCTT TTGTGCAACC AAGftATAGGG 

27> T V F YGVR A W R NAT 1PLF CAT K N R 
911 ATACITGGGG MCAACTCAG TGOCTSACCAG A7AAQV3GTGA 1TATI1CAGAA GT&GCCCTrA ATTOTAC^i 
50KDTWQ T.TQ CLP DNGD YSE VAL N V T E 
981 AAGCITTCAT GCC?I\3GAA7A ATAOAtTTCAC AGAAOAOGOA ATAGAGGATC TA1X3GCAACV CTTTOAGACr 
73* S F D A W N NTVT 5 Q A IE0 VWQL FET 
1051 TCAATOAAGC CMTGTGTAAA ATTATQCCCA T7ATCCATTA CTATG&GATG CAATAAAAG7 GAC&CAGATA 

97> S i' K P C V K LSP LCI TMRC N KS ETD 
1121 GATGGGGATT GACAAAATCA ATAACAACAA CAGCATCAAC AACATCAACG ACAGCATCAG CAAAAGTAGA 
120> RWGL TKS ITT TAST T S T T A S AKVD 
1191 CAlGGi>CAAT GAGACTAGTT CHOTAmGC OTAGGATAAT TGCACAGGCT TO3AACAAGA C^AAATGATA 

143> M VM E T S SCIA QDN CTG LEQt OMI 
1261 AGCTOTAAAT 'PCAACATGAC AGGGTTAAAA AGAGACAAGA AAAAAGAGTA CAATGAAACT TGCTACPCTG 
167> SCK FNMT GLK RDK K KEY NET W Y 5 * 
1331 CAGAnTGGT ATGTCAACAA GGCAATAACA CTCCTTAATGA AAGTAOATGT TAOTTGAACC ACTCTAACAC 
19D»A 0 L V CEQ GNN TGNE SRC YMN HCN7 
1401 TTCTGITATC C?*AGAJ?rCTT CTGACAAACA TTA'ITGGGAT GCTATTAGAT TTAGGTATTG 1SGCACCTCGA 
213> SVI GE3 C DKH Y W D AIR FRYC APP* 
1471 GGTTATGCTO TAATCAGACA AATTATTCAG GCTTOJTGCC TTLAATOTrC^ AAGGTO7T3G 

237* GYA LLR C NDT NYS GFMP KCS K V V 
1541 TCTCT1CATG CACAAGGATG ATtSGAGACAC AOAOTCTAC' ITCffJTTGGC TTrAATQGAA CTAGACX2AGA 
;i60^V SSC TRNfl MET QTBT.WFG FNG TRAE 
1611 AAATAGAACT TATATTPACT GGCATGGTAG GGATAATAGG ACTAT^ATTA. GTTTAAATAA GTAT7ATAAT 

?.83^ NRT YIY W HGR DNR Til SLNK Y Y N 
1'681 CPAACAAT-'iA AATGTTA?3AAG ACCAGGAAAT AAGACACTITr TACCACnCAC CA r ITATGTCr GG^l'IOTH^ 
307^ LTM KCRR PGN KTV LPVT IMS GLV 
1751 TCCACTC'ACA ACCAATCAAT GniTAGGCCAA AT^GCATG GTGTOJGTn' GG^GGAAA/iT GG^r/aWSlTC 
330^-F HSQ PI N DRP KQAW CWF GGK WKDA 
1821 AATAAAAGAG GTCAAGCAGA CCATTGTCAA 'ACA1CCXIAGG TATACTGGAA CTAACAATAC TGATAAAATC 

253^ IKE VKQ TIVK HP R YTG TNNT DKl 
1891 AA11TCACGG CrCCTOGAO'J ACX&GATCCG GAAGTTACO* TCATGTGGAC AAA'JTGCAGA OC^AG^GTr^ 
377K N LT APGG GDP EVT FMWT NCR QEF 
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1961 TCTACTGTAA AATGAATTCG TTTCl^AAATT G3OTAGAAGA TAGGAATACA GCTAAOCAGA ACCCAAAQGA 

400»L YCK M N W F L N WVED R N T A N Q K ? K E 
2031 ACAGCATAAA AQGAATTACG TCCCATOTCA TATTAGACAA ATAATCAACA CTO3GEATAA AflTOiSGCLAAA 

423* Q HK RNY VPCH I R Q I I N TWHK VGK 
2101 AATGOTTATT TOCCTOCAAG AGAGGGAGAC CTCACGTCTA AdCCACAGTT tZMCAGTCTC AOfcflCAAACA 

447*- NVY LPPR.EGD LTC N S T V TSL IAN 
2171 TAGATOGGAT 'IK2ATGGAAAC CAAACEAATA TCAOCATGAG TGCAGAOGTG GCAGAACTGT ATOGATTGGA 

470M OWI DGNGTN ITMSAEVAEL YRLE 
2241 ATTGQGAGAT TATAAATTAG TAGAGA1CAC TCCAA'rFGGC TCH3CCOCCA CAGATGTCAA GAGC7TACACT 

493> L G D YKL VEIT PiGLAP TDVK RYT 
2311 ACTGGTGGCA OCTCAAGAAA TAAAAGAGGG GlXTTTrOTOC TAGGGf^PC , ^^ , OXTmTCTC GCAACGGCAO 

5!7> TGG TSRN KRG VFV LGFL G F L , A T A 
2381 CTTCTOCAAT GOIAGOCGOC AGCCTCAOCC TCAOGGCACA CTCCOGAACT TEATTOGCTG GGA3&GT0CA 

540>G SAW GAA S L T LTAQSRT LLA GrVQ 
2451 ACAGCAGCAA CAGCTCTTGG ACOTGC7ICAA GAGACAACAA GAATKTTTGC CSACTG&GCOT CTCGGOAACLA 

563* QQQ GLL OVVK R Q Q ELL RLT V WGT 
2521 AAGAACCTCC ACACTAGGGT CACTGOCATC GAGAAGTACT TAAAGGACCA GCXX3CAGCTG AATCCTTOGG 

S87> KNL QTRV T A I £ K Y LKOQAQL N A W 
2591 GATGTCCGOT TAGACAAOZC TCOCACACTA CTGTACCATG GCCAAATGCA AGTCTAACAC CKAAGTGGAA 

610* G CAF RQV CHT TVPW P N A SLT PKWN 
2661 CAATGAGAOT 'K3GCAAGAGT GGGAGCGAAA GGTTGACiTC TFGGAAGAAA ATATAACAGC CCrcCTAGAfi 

633* NET WQE WERK VDF LEE NITA LLE 
2731 GAGGCACAAA TOCAACAAGA GAAGAACA7G TASUWTTAC AAAAOTTCAA TAGCTGGGAT GTCTTTGGCA 

65?^ EAQ IQQE KNM YEL QKLN SVVD VFG 
2801 AITGCOTTGA CXTITGePTCT TGGATAAAGT ATATACAATA TGGftGfTCrAT ATAGTKJTAG GAGTAATACT 

680^N WFO LAS W I K YIQY GVY fVV GVIL 
2871 GTTAAGAATA CTGATCEfVPA TAGTACAAAT GCTAGCTAAG TTAAGGCAGG GCTATAGGCC AGICTPTTCT 

703> LRI V!Y IVQM LAKLRQ GYRP V F S 
2941 TCCCCACCCT CITAT1TOCA GCAGA0XAT ATCCAACAGG ACCCGGCAC? GKXAACCAGA GAAGGCAAAC 

727> SPP SYFOQTH IQ ODPAL PTR EGK 
3011 AAAGAT^ACOG TCGAGAAGC3C GSTOGCAACA GCaXX.IWOC 1TCGCAGATA GAATATATOC ACrmVrrrAT . 

750>E ROG GEG GGN SSWP WQi EYI HFLI 
3081 TCG7CAGCTT ATTAGACTCT T3ACTTCGCT A7TCACTAAC TCTAGGACTT TOCTATOGAG AGTATACCAG 

773> RQL i R L LTWL FSN CRT LLSR V Y Q 
3151 ATCCTCCAAC CAATACPCCA G^GGCTCICT GCGACOCTAC AGAGSATTCG AGAAGTOC1C AGGACTGAAC 

797* i L Q PILO ft L S ATU QRIR EVL RTE 
3221 TGACCPACCT ACAATATCGG TGGAGCEATT TCCATGAGGC GGTCCAGGCC GTCTGGAGAT C1T3GCGACCG 

820^L TYL QYGW3Y FHEA V Q A VWR SGDR 
3291 ACACiAGACTC TTCCGGGCGC aiVSGGGAGAC TOATCGJGAGA CICTTAGMA CTCTTCOGGG CGCGTGGGGA 

843^ ORL LRA RGE T Y G R LLE TLAGAW-G 
3361 G/vCTOAIXWG AGACTCITAG GAGAGGAGAG CTGGAAGATG GATACTCGAG GTGGAAGATG GAT?vCTCGCC 

867^.D L W ETLR RGE VED GYSR WKM DTR 
3431 AAATCCATCC GCAG»3AGGAT TAGACAAGG*5 CCTGAGCTCA CTCrCTTGTG A^XAGGAGGA TTAGACAAGG 

890^Q I H P Q E D 

BamHI (3530) 

3501 GCTTGA^CTC ACTCTCinOT GAGGGACAGA GG^CAGAgg atccaccagt tj:cagaCTCG AGGGGGGGCC 
3571 OGGTACGAGC OCTPAGOrAG CT^AGAOCA CCTOCCCTGC GAGCTAAGCT GGACAGCCAA TGACGGGTAA 



3641 GAGAGTGACA nTTTCACTA ACCTAAGACA GGAGGGCCGT CAGAGCTACT GOCTAATCCA AAGACGGCTA 



3711 7%AAGTGATAA AAATGTATCA CTCCAAOCTA AGACAGGCGC AGCTICCGAG GGATITGTCG TCTCTTTTAT 



3781 ATATATTTAA AAGGGTCAOC TCTCCGGAGC CGTGCTGCGC GGATGATGTC TTGGTCTAGA CTCGAGGGGG 
3fi51 GGCCCGGTAC GATCCAGATC mTTOTGCXTT TCI'AGTi^CXV ^ICA^sXfi' 'VCTVVmCCCC. l VC\XXOTir>C 



3921 CTIX^CTTGAC CCYGGIU^XZJV -3CCAC1CCCA iTIVICCTTIC «n , AVl 1 AAA.Vl > GAGGAAAITrO CIKltVCKPVO 
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3991 TW*ftC?TAs3C> TTSX3mT?i% rra\X3GGGG ItX^TFJOOG CAQ&CAGC& £Q3CX5GAGGn TltsaGAAGAe 



Kpnl (4097) 

4061 A/aTAGCAGGC ASXCTGGGCSA ^COCTrcGGC 1S2TATO'.^TA CCCACCTGCf? GAAGAATTGA CCrOSGTTOCT 

4131 arriXx:ccA„G aaa^acscsu? gcj^oatxcc TOtrcTOTGA ac^ccrOT- cd^CGCvX^cr craurrracyr 

4201 rCCAGCCCCA CTCATAGGA*? ACTCATAGCr GAGGAC^GOT- CCGKXTTCAA 'ItXXt^jCOCGC raAA£ft%OTT 
flgaB8W^»iiiMii i ii!M^.m.i..'. , ni,.Li.r,TrriiL, lT i rmirrnmnT i nMi iii f iii i i in'W 



4271 OCAT^DGTCT CnXCfOOCT CATO&COCA CCAAAOCATUl OCTAGCCTCC A^V3TC»&\ AOAA?»TTAAA 
4341- CTSlTT/AOTti C4GftCC3G$£A GAAAA'PGCCT COAACAIVnK GAGAJiAAATC 



4411 TTCCGCTTCC TCGCTCACTG ACTCX3CTGCX3 CTOTjTCGTT CGGCTGOGGC GAGCGGTATC 



4481 
4551 
4621 
4691 
4761 
4831 
4901 
4971 
5041 
5111 
5181 
5251 
5321 
5391 
5461 
5531 
5601 
5671 



AGCTCACTCA 
AAAGGCCAGC 
CTGACGAGCA 
GGCGTTFCCC 
GCCTTTCTCC 
TCGITCGCTC 
CTATCGTCTT 
AGCAGAGCGA 
GGACAGTATT 
CGGCAAACAA 
GGATCTCAAG 

GGAirrrGGT 

ATCAATCTAA 
TCAGCGATCT 
CGTGAAGAAG 
CACGGTTGAT 
GTCTQCGTTG 
CGCCCTCCCG 



5741 CTCATCGAGC 

2694 E D L 
5811 CGTTTCTCTA 

2454 R K Q L 
5881 CXIATTCCGAC 

222^ I G V 
5951 GAAATCACCA 

1994 F D G 
6021 TCAACAGGCC 

1754 E V P W 
6091 GCGCCTGAGC 

1524 A Q A 
6161 GCGCAGGAAC 

1294 R L F 
6231 GCTGTTTPCC 

1054 A T KG 
6301 TCGGAAGAGG 
.824 PL? 
6371 ACCTITGCCA 

594 G KG 
6441 GATTGCCCGA 
354S Q G V 
6511 GCCTCGAGCA 

124 R S C 
6581 CAGTTTTATT 



AAGGCGGTAA 
AAAAGGCCAG 
TCACAAAAAT 
CCTGGAAGCT 
CTTCGGGAAG 
CAAGCTCGGC 
GAGTCCAACC 
GGTATCTAGG 
TCGTATCTGC 
ACCACOGCTG 
AAGATOCTTO 
CATGAGATTA 
AGTATATATG 
GFCTATFTCG 
CTGTTGCTGA 
GAGAGCTTTG 
TCGGGAAGAT 
TCAAGTCAGC 

ATCAAATGAA 
M L H F 
ATGAAGGAGA 

S P S 
TCGTCCAACA 

R G V 
TCAGTGACGA 
H T V V 
AGCCATTACG 

G N R 
GAGACGAAAT 

L R F 
ACTGCCAGCG 
V A L A 
GGGGGATCQC 

P I A 
CATAAATTCC 
M F E 
TCTTTCAGAA 
H K L F 
CMTATCGCG 

N D R 
AGACGTTTCC 

S T E 
GMTCATGATG 



TACGGTTATC 
GAACCCTAAA 
CGAOGCPCAA 
CCCTCGTOCG 
CGTCGCGCTT 
TCTGTGCACG 
CGGTAAGACA 
CGGTCCTACA 
GCTCTGCTGA 
GTAGCGGTGG 
GATCTTTTCT 
TCAAAAAGGA 
AGTAAACTPG 
TTCATCCATA 
CTCATACCAG 
TTGTAGCTGG 
GCGTGATCTG 
GTAATGCTCT 

ACTGCAATTT 

Q L K 
AAACTCACCG 

F E G 
TCAATACAAC 
D I C G 
CTGAATCCGG 

S D. P 
CTCGTCATCA 

E D D 
ACGCGATCGC 
V R D S 
CATCAACAAT 

D V I 
AGTGGTGAGT 

T T L 
GTCAGCCAGT 
T L W N 
ACAACTCTGG 

L E P 
AGCCCATTTA 
A W K 
CGTTGAATAT 
R Q I H 
ATATATTTTT 



CACAGAATCA 
AAGGCCGCGT 
GTCAGAGGTG 
CTCTCCTGTT 
TCTCAATGCT 
AACOCCCCGT 
CGACTTATCG 
GAGTTCTTGA 
AGCCAGTTAC 

wmw 

ACGGGGTCTG 
TCTFCAOCTA 
GTCTGACAGT 
GTTOCCTGAC 
GCCTGAATGG 
ACCAGTTGCT 
ATCCTTCAAC 
GCCAGTGTTA 

ATTCATATCA 

NMD 
AGGCAGTTCC 
L C N W 
CTATTAATTT 

I L K 
TGAGAATGGC 

S F P 
AAATCACTCG 
F D S A 
TGTFAAAAGG 

N F P 
ATTTTCACCT 

N E G 
AACCATGCAT 
L W A D 
TTAC7TCTGAC 

L R V 
CGCATCGGGC 

A D P 
TACOCATATA 
Y G Y L 
GGCTCATAAC 

S M 
ATCT1OTGCA 



GGGGATAACG 
TGCTGGOGTT 
GCGAAACCCG 
CCGAOCCTGC 
CACGCTGTAG 
TCAGOCOGAC 
CCACTCGCAG 
AGTGCTGGCC 
C1TCGGAAAA 
TGCAAGCAGC 
ACGCTCAGTG 
GATCCITTTA 
TACCAATGCT 
TCCGGGGGGG 
CCCCATCATC 
GATTTTGAAC 
TCAGCAAAAG 
CAACCAATTA 

GGATTATCAA 
P N D I 
ATAGGATGGC 

L I A 
CCCCTCGTCA 

G E D 
AAAAOCTTAT 
L L K H 
CATCAACCAA 

D V L 
ACAATTACAA 

C N C 
GAATCAGGAT 
S D P Y 
CATCAGGAGT 

DPT 
CATCTCATCT 
MED 
TTCCCATACA 
K G Y L 
AATCAGCATC 

DAD 
ACCCCTTGTA 



GAGGAAAGAA 
TITOCATAGG 
ACAGGACTAT 
OQCTTAOCGG 
GTATCTCAGT 
CGCTGCGCCT 
CAGCCACTGG 
TAACTAOGGC 
AGAGTTGGTA 
AGATTAOGCG 
GAAGGAAAAC 
AATTAAAAAT 
TAATCACTGA 
GGGGGCGCTG 
CAGCGAGAAA 
TTITGCTrrG 
TTCGATTTAT 
AOCAATTCTG 

TACCATATTT 

G Y K 
AAGATCCTGG 

L D Q 
AAAATAAGGT 
F I L N 
GCAITTCTTT 

M E K 
AOCGTTATTC 

G N N 
ACAGGAATCG 
V P I S 
ATTCTTCTAA 

EEL 
AOGGATAAAA 

R I F 
GTAACATCAT 
T V D N 
ATCGATAGAT 

R Y I 
CATGTTQGAA 

M N S 
TTACTGTTTA 



CATGTGAOCA 

CTOGGOCOCC 

AAAGATACCA 

ATACCTGTCC 

TCGGTGTAGG 

TATCCGGTAA 

TAACAGOATT 

TACACTAGAA 

GCTCTTGATC 

CAGAAAAAAA 

TCACGTTAAG 

GAAGTTTTAA 

GGCACCTATC 

AGCTCTGCCT 

GTGAGGGAGC 

CCACGGAACG 

TCAACAAAGC 

ATPAGAAAAA 

2714 F F 

'ITGAAAAAGC 

Q F L 
TATCGGTCTG 
Y R D A 
TATCAAGTOa 

D L S 

CCAGACIT3T 

W V Q 
ATPCGTGATT 
M R S Q 
AATCCAACCG 
H L R 
TACCTGGAAT 

V Q F 
TGCTTGATCG 
H K I T 
TGGCAACGCT 

A V S 
TGTCGCACCT 

TAG 
TTTAATCGG.l 
N L R P 
TGTAAGCAGA 



ATGTAACATC AGAGATTTTG AGACACAACG 
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6651 TGGCT TTCCC 

6721 AATGTATTTA 

6791 AGAAAOCATO 

6861 TTCGCTGATC 

6931 ATOCCGGGAG 

7001 TGCGGCATCA 

7071 AGAAAATACC 
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OQCOCOCCCA TTATTGAAGC ATTFATCAGG 
GAAAAATAAA CAAATAGGGG TTCQGCGCAC 
A3TATCATCA CATTAACCTA TAAAAATAGG 
ACGGTGAAAA CCTCTGACAC ATGCAGCTOC 
CAGACAAGCC CGTCAGGGCG CGTCAGCGGG 
GEAC3CAGATTG TACTGAGAGT (XAOCATATC 
GCATCAGATT GGCTATTGG 



GTTATTCTCT CATGAGOGGA TACATATTPG 
ATTTCCCCGA AAAGTGOCAC CTCACGTCTA 
CGTATCACGA GOCCCTITCG TCTCGCGOGT 
CGGAGACGGT CACAGCITGT CTCTAAGCGG 
TGTTGGCGGG TCPOQGOGCT GGCTTAACTA 
CGGTGfTGAAA TOCCX3CACAG ATCCGTAAGG 



